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(57) Abstract 

A virus, or virus-like partide. derived from a virus or virus- 
like parade having a receptor Tor a host cell co m pri sin g a modified 
binding specificity conferred by a binding moiety allowing the virus 
or virus-like partide to bind to a target cell characterised in that the 
said host cell receptor is modified or absent so that the virus or vi- 
rus-like partide is substantially incapable of binding the said host 
cell are disdosed, An adenovirus or influenza virus or vaccinia vi- 
rus, or a replication defective derivative of any of these, character- 
ized in that the virus has a modified binding spedfichy conferred by 
a binding moiety allowing the virus to bind to a target cell are dis^ 
dosed. Suitable binding moieties indude monodonal antibodies, 
ScFvs, dAbs and minimal recognition units. The use of at least some 
of these as delivery vehides for genes to target cells in the fidds of 
erne therepy and cancer treatment are disdosed. 



binds 






HI 



Host ojU(Hacstd) 



Tanjnt cstt (not 



Heat aril (not 
) 



o- O 



Tarant csU 
Orrtscted) 




Hoi 
o-To] 



Has* csU (rat 
infected) 



/ 



FOR THE PURPOSES OF information only 



,. podes mod to identify Stales parry to the PCT on the from pages of pamphlets publishing international 
applications under the PCT. r b 



AT 


Awarla 


CB 


All 


Awn !!■ 


CK 


M 


Bafwados 


CN 


as 




Ct 


ar 


Burkina FOOD 


mi 


BC 


Buiaa/ti 


a 


BJ 


Benia 


fT 


aa 


Brazil 


j» 


BY 


Belarus 


KI 


CA 


Cusad* 


KC 


CF 


Central African Republic 


K> 


CC 


Coaao 




CH 


Swtuertand 


III 


a 


Cote dlvoire 


KZ 


CM 


< ■■rrniMi 


U 


CN 


China 


UC 


CS 




Ul 


CZ 


Cast* Republic 


LV 


DC 


Gcreaany 


MC 


DK 


Denmark 


MO 


ES 


Spaia 


MC 


PI 


Finland 


«t 


n 


France 


MN 


CA 


Gabon 





Uahed t«i(rtn» 
Coo 

Ck.ii 



Huaaary 

IrtsiMd 

Itaiy 



Kyrfjnuu 

rWin mil People '■ Republic 
of Lore* 

Republic al Kara* 



Sri I 

Luna 
Ulna are 

Rcpublk of Moldova 
Mali 



MR 


Maurtiaaia 


MW 


Malawi 


NC 


Nicer 


NL 


NctlicTaaUids 


NO 


Norway 


HZ 


New Zealand 


PL 


Poland 


rr 


Porunjal 


RO 


Roaiaaia 


Rtl 


Ruanan Federation 


so 


Judas 


ss 


Swodea 


SI 


Slovenia 


SK 


Slovakia 


SN 


Senegal 


TO 


Chad 


TC 


Togo 


TJ 


Tajik ittaa 


TT 


Trinidad aad Tobago 


UA 


Ukraine 


US 


Unlied Sulci ol Ame 


uz 


Urhriinan 


VN 


Viet Nam 



WO 94/10323 



PCT/GB93/02267 



1 

VIRUS WITH MODIFIED BINDING MOIETY SPECIFIC FOR THE TARGET CELLS. 

The present invention relates to delivery vehicles for genes to target cells, 
especially in the fields of gene therapy and cancer treatment. 

5 

The delivery of genes to target cells, especially those within the 
mammalian body, has many uses, for example in the fields of gene 
therapy, cancer treatment and in areas of genetic manipulation still to be 
discovered. The gene to be delivered may encode a molecule, such as a 
10 protein or RNA, which is cytotoxic to the target cell, or it may encode a 
functional copy of a gene that is defective in the target cell. In this latter 
case the product of the aforementioned functional copy of the gene will 
replace that of the defective copy, and the target cell will be able to 
perform its proper function. 

15 

The use of viruses, or virus-like particles, to deliver genes for gene 
therapy and cancer treatment has been disclosed. 

However, in most cases the targeting of the virus or virus-like particles 
20 containing the desired gene to the cell has relied on the natural host-virus 
specificity or on local application of the virus to the cells to be targeted, 
for example direct application of viruses to lung cells by inhalation. 

The human adenovirus 5 (Ad5) genome consists of a double-stranded 
25 linear DN A molecule of 36 kilo-basepair. The virus replication cycle has 
two phases: an early phase, during which four transcriptional units £1, 
£2, £3, and £4 are expressed, and a late phase occurring after the onset 
of viral DN A synthesis when late transcripts are expressed from the major 
late promoter (MLP). These late messages encode most of the viral 
30 structural proteins. £1 , £2, and E4 gene products of human adenoviruses 
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(Ads) are involved in transcriptional activation, cell transformation, and 
viral DNA replication as well as other viral functions, and are essential for 
viral growth. In contrast, E3 gene products are not required for viral 
replication in cultured cells or for acute lung infection of cotton rats, but 
5 appear to be involved in evading immune surveillance in vivo. 

By "virus-like particle" we mean a nucleoprotein particle containing a 
core of nucleic acid surrounded by protein which (i) is not infective and 
(ii) can only be propagated in a suitable cell system following 
10 transformation by its nucleic acid. Thus a virus-like particle of 
mammalian origin may be propagated in Saccharomyces cerevisiae or in 
insect cells via a baculo virus expression system. 

The modification of coat proteins of filamentous bacteriophages (bacterial 
15 viruses), such as M13 and fd, so as to generate novel binding properties, 
has been disclosed in Cwirla a al (1990) Proc. Natl. Acad. Sci. USA 87, 
6378-6382 and Scott & Smith (1990) Science 249, 386-390. 

It has previously been suggested that retroelement particles, including 
20 retroviral vectors, may be modified to target specific cells, for example 
see Kingsman ei al (1991) Tibuch 9, 303-309. 

Russell et al (1993) NucL Acids Res. 21, 1081-1085, published after the 
priority date for this application but before the filing date discloses 

25 retroviral vectors displaying functional antibody fragments and suggests 
that, in principle, the display of antibody fragments on the surface of 
recombinant retroviral particles could be used to target virus to cells for 
gene delivery. However, it is not known whether a retrovirus can be 
assembled in which all the subunits of the viral envelope protein are fused 

30 to antibody, and if so whether the virus would infect cells. 
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NIP-derivatised human cells were tested as a method for targeted gene 
delivery, but became permissive for both modified (displaying an anti-NIP 
antibody) and unmodified ecotropic viral particles. NIP is 4-hydroxy-3- 
iodo-5-nitrophenylacetic acid. 

5 

Michael et al (1993) J. Biol Chan. 268, 6866-6869, published after the 
priority date of this application but before the filing date, describes 
molecular conjugates between adenovirus and a vector system comprising 
two linked domains, a DNA binding domain and a ligand domain. In this 
10 configuration, however, it is stated that the viral moiety functions in the 
capacity of both an alternate ligand domain of the conjugate and, since an 
additional ligand has been introduced into the conjugate design, the 
potential for cell-specific targeting is undermined. 

15 C urid a al (1992) Human Gene Therapy 3, 147-154 describes 
adenoviruses wherein a foreign epitope was introduced into the hexon 
protein and polylysine-antibody complexed DNA was attached to 
adenovirus by virtue of the antibody binding the foreign epitope on the 
hexon. Foreign DNA is transferred bound to the exterior of the virion. 

20 

The above-mentioned viruses and virus-like particles may be able to target 
cells using the binding moiety displayed on their surface but they can also 
still target their natural host cells. 

25 We have now devised new viruses and virus-like particles at least some 
of which can bind the target cell with high specificity and may deliver 
genetic material to the target cell; at least some of the viruses and virus- 
like particles may bind and deliver genetic material to the target c el l 
without substantially binding to the natural host cell of the virus. 
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One aspect of the present invention provides a virus, or virus-like particle, 
derived from a virus or virus-like particle having a receptor for a host cell 
comprising a modified binding specificity conferred by a binding moiety 
allowing the virus or virus-like particle to bind to a target cell 
5 characterised in that the said host cell receptor is modified or absent so 
tfaat the virus or virus-like particle is substantially incapable of binding die 
said host cell. 



By "substantially incapable of binding its host cell" we mean that the 
10 modified virus has no more than 1% of the binding affinity of the 
unmodified virus for the host cell. 

In general, the binding specificity of a natural virus or virus-like particle 
is conferred by the specific interaction between a receptor-like molecule 
IS expressed on the surface of the virus or virus-like particle and a cognate 
receptor-like molecule expressed on the surface of its host cell. The 
invention provides a beneficial modification of the binding specificity, so 
- that the virus or virus-like particle can bind to a different specific target 
cell. 

20 

The introduction of the modified binding moiety may be such as to 
achieve the said removal of the native binding specificity. 

A second aspect of the invention comprises an adenovirus or influenza 
25 virus or vaccinia virus, or a replication-defective derivative of any of 
these, characterised in that the virus has a modified binding specificity 
conferred by a binding moiety allowing the virus to bind to a target cell. 

By "binding moiety" we mean a molecule that is exposed on the surface 
30 of the virus or virus-like particle which is able to bind to a molecule on 
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the target cell. The "binding moiety" may be a molecule on the virus or 
virus-like particle modified in such a way that its binding specificity is 
changed, or it may be a molecule added to, and exposed on the surface of, 
the virus or virus-like particle to provide a new binding specificity. 

5 — ^ 

It is preferred if the binding moiety is external to the receptor for its host 
cell of the naive, unmodified virus. 



10 



It is further preferred if the binding moiety is joined or fused to the virus 
or virus-like particles directly or indirectly by a spacer group. 



By "host cell" we mean the cell that an unmodified, naive virus can bind 
to using its receptor-like molecule and the cognate receptor-like molecule 
on the cell. By "target cell" we mean the cell that the modified virus can 
15 bind to using its binding moiety. In some circumstances in the context of 
the second aspect of the invention, such as when the binding moiety 
recognises an entity on the host cell which is not the cognate receptor-like 
molecule, then the host cell may be the target cell. 



20 The virus or virus-like particle may be a bacteriophage and the target cell 
a bacterium in which case the invention may find uses in the treatment of 
bacterial infections. 



In a preferred embodiment of the invention the target cell is cukaryotic. 
25 The eukaryotic cell may be a yeast cell and the virus or virus-like particle 
may be useful in the medical field in treating yeast infections such as 
athlete's foot or Can did a infection but it is preferred that the eukaryotic 
cell is mammalian, and it is expected that the invention will find uses in 
the areas of gene therapy and cancer treatment. 



30 
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In preferred embodiments of the first aspect of the invention the virus or 
virus-like particle is adenovirus or influenza virus or a pox-virus such as 
vaccinia. 

5 It is also preferred that the virus or virus-like particle is " replication- 
defective By "replication defective" we mean a virus whose genetic 
material has been manipulated so that it cannot divide or proliferate in the 
cell it infects. 

10 The binding moiety of the virus or virus-like particle of the invention / 
provides the target cell binding specificity. Any cell-binding protein or | 
peptide or carbohydrate or lipid may be useful for targeting the virus or 
virus-like particle to the cell. For example, short linear stretches of amino 
acids, such as those constituting a peptide hormone, are useful, as are 

15 domains of polypeptides that can fold independently into a structure that 
can bind to the target cell. 

In one preferred embodiment the binding moiety has the property of any ! 
one of a monoclonal antibody, ScFv (single chain Fv fragment), a dAb 
20 (single domain antibody) or a minimal recognition unit of an antibody. 

The binding site on the target cell may be a target cell-specific antigen. 
Such antigens are listed in Table 1 . Other binding moieties, targets on 
cells, and diseases which could usefully be treated using reagents delivered 
25 by the modified viruses or virus-like particles are given in Table 2. 
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T ab le 1 



1- Tumour Associated Anrip*™ 

Antigen Antibody 

Carcino-embryonic {C46 (Amersham) 
Antigen {85AI2 (Unipalh) 



Placental Alkaline 
Phosphatase 



H17E2 (ICRF, 
Travers & Bodmer) 



Existinp 

Imaging & Therapy 
of colon/rectum 
tumours. 

Imaging & Therapy 
of testicular and 
ovarian cancers. 



Pan Carcinoma NR-LU-10 (NeoRx Imaging & Therapy 

Corporation) of various 



carcinomas incl. 
small cell lung 
cancer. 



10 Polymorphic 

Epithelial Mucin 
(Human milk fat 
globule) 



HMFG1 (Taylor- 
Papadimitriou, ICRF) 



Imaging & Therapy 
of ovarian cancer, 
pleural effusions. 
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0-human Chorionic 
Gonadotropin 



W14 



Targeting of enzyme 
(CPG2) to human 
xenograft 

choriocarcinoma in 
nude mice. (Searie 
et al (1981) Br. J. 
Cancer 44, 137-144) 



10 



A Carbohydrate on 
Human Carcinomas 



L6 (IgG2a)' 



CD20 Antigen on B 
Lymphoma (normal 
and neoplastic) 



1F5 GgG2a) 2 



Targeting of «llra1i™> 
phosphatase. (Senter 
et al (1988) PMA.S. 
85, 4842-4846 



Targeting of alkaline 
phosphatase. (Senter 
et al (1988) P.N.A.S. 
85, 4842-4846 
'Hellstrom et al (1986) Cancer Res. 46, 3917-3923 
^Clarke et al (1985) P.N.A.S. 82, 1766-1770 

Other antigens include alphafoetoprotein, Ca-125 and prostate specific 
antigen. 



2. Immune Cell Antigens 

15 Pan T Lymphocyte OKT-3 (Ortho) 
Surface Antigen 
(CD3) 



As anti-rejection 
therapy for kidney 
transplants. 
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B-lymphocyte 
Surface Antigen 
(CD22) 

Pan T lymphocyte 
Surface Antigen 
(CDS) 



Hepatitis B Surface 
Antigen 



RFB4 (Janossy, 
Royal Free Hospital) 

H65 (Bodmcr, 
Knowles ICRF, 
Licensed to Xoma 
Corp., USA) 



Anti HBs Ag 



Immunotoxin therapy 
of B cell lymphoma. 

Immunotoxin 
treatment of Acute 
Graft versus Host 
disease, Rheumatoid 
Arthritis. 



Antibody conjugated 
to Diphtheria toxin 
for treatment of 
mumps. 

Immunotoxin against 
Hepatoma. 



>• Infectious Apent-Relateri Antics 

Mumps virus-related Anti-mumps 

polyclonal antibody 



Table 2; Binding moieties for tumour-speeifie t argets and * m m > 
associate^ flntigffnff 
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Target 



Binding moiety 



Disease 



10 



15 



20 



25 



Truncated EGFR 
Idiotypes 
EGFR (c-erbBl) 

c-erbB2 
IL-2 receptor 

IL-4 receptor 

IL-6 receptor 

MSH (melanocyte- 
stimulating hormone) 
receptor 

Transferrin receptor 
(TR) 

gp95/gp97 
p-glycoprotein cells 
cluster- 1 antigen (N- 
CAM) 
cluster-w4 

cluster-5A 



anti-EGFR mAb A 
anti-id mAbs 
EGF, TGFo 
anti-EGFR mAb * 
mAbs 
IL-2 

anti-Tac mAb '' 
IL-4 

IL-6 

a-MSH 



Transferrin 
anti-TR mAb 

mAbs 
mAbs 
mAbs 



mAbs 



\ 



mAbs 





cluster-6 (LeY) 


mAbs 




PLAP (placental 


mAbs 


30 


alkaline phosphatase) 






CA-125 


mAbs 




ESA (epithelial 


mAbs 


35 


specific antigen) 






CD 19, 22, 37 


mAbs 




250 kDa 


mAbs 




proteoglycan 






p55 


mAbs 


40 


TCR-IgH fusion 


mAbs 




Blood gp A antigen 


mAbs 




On B or O 






individuals) 





Gliomas 

B-cell lymphomas 
Breast cancer 

Breast cancer 
Lymphomas 
and leukaemias 
Lymphomas 
and leukaemias 
Lymphomas 
and leukaemias 
Melanomas 



Gliomas 



Melanomas 
drug-resistant 
Small cell lung 
carcinomas 
Small cell lung 
carcinomas 
Small cell lung 
carcinomas 
Small cell lung 
carcinomas 
Some seminomas 
Some ovarian; 
some non-small cell 
lung cancer 
Lung, ovarian 
carcinoma 

B-cell lymphoma 
Vfelanoma 

' Breast cancer 
Childhood T-cell 
eukaemia 
Gastric and colon 
tumours 
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The binding moiety may be a monoclonal antibody. Monoclonal 
antibodies which will bind to many of these antigens are already known 
but in any case, with today's techniques in relation to monoclonal antibody 
technology, antibodies can be prepared to most antigens. The binding 
5 moiety may be a part of an antibody (for example a Fab fragment) or a 
synthetic antibody fragment (for example, ScFv). Suitable monoclonal 
antibodies to selected antigens may be prepared by known techniques, for 
example those disclosed in "Monoclonal Antibodies: A manual of 
techniques", H Zola (CRC Press, 1988) and in "Monoclonal Hybridoma 
10 Antibodies: Techniques and Applications \ J G R Hurrell (CRC Press, 
1982). 

Suitably prepared non-human antibodies can be "humanized" in known 
ways, for example by inserting the CDR regions of mouse antibodies into 
15 the framework of human antibodies. 

The variable heavy (Vk) and variable light (VJ domains of the antibody 
are involved in antigen recognition, a fact first recognised by early 
protease digestion experiments. Further confirmation was found by 
20 "humanization" of rodent antibodies. Variable domains of rodent origin 
may be fused to constant domains of human origin such that the resultant 
antibody retains the antigenic specificity of the rodent parental antibody 
(Morrison et al (1984) Proc. NatL Acad Sci. USA 81, 6851-6855). 

25 That antigenic specificity is conferred by variable domains and is 
independent of the constant domains is known from experiments involving 
the bacteria] expression of antibody fragments, all containing one or more 
variable domains. These molecules include Fab-like molecules (Better et 
al (1988) Science 240, 1041); Fv molecules (Skerra et al (1988) Science 

30 240, 1038); ScFv molecules where the V H and V L partner domains are 
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linked via a flexible oligopeptide (Bird et al (1988) Science 242, 423; 
Huston et al (1988) Proc. Natl. Acad. Sci. USA 85, 5879) and dAbs 
comprising isolated V domains (Ward et al (1989) Nature 341, 544). A 
general review of the techniques involved in the synthesis of antibody 
5 fragments which retain their specific binding sites is to be found in Winter 
& Milstein (1991) Nature 349, 293-299. 

By "ScFv molecules" we mean molecules wherein the V H and V L partner 
domains are linked via a flexible oligopeptide. 

10 

It may be advantageous to use antibody fragments, rather than whole 
antibodies. Effector functions of whole antibodies, such as complement 
binding, are removed. ScFv and dAb antibody fragments can be 
expressed as fusions with other polypeptides. 

15 

Minimal recognition units may be derived from the sequence of one or 
more of the complementary-determining regions (CDR) of the Fv 
fragment. Whole antibodies, and F(ab')j fragments are "bivalent". By 
"bivalent" we mean that the said antibodies and F(ab')j fragments have 
20 two antigen combining sites. In contrast, Fab, Fv, ScFv, dAb fragments 
and minimal recognition units are monovalent, having only one antigen 
combining sites. 

In a further embodiment the binding moiety is at least part of a Iigand of 
25 a target cell-specific cell-surface receptor. 

It is preferred that the target cell-specific cell-surface receptor is the 
receptor for human gonadotropin releasing hormone (GnRH). In this 
preferred embodiment the binding moiety is GnRH, and its binding 
30 specificity is o for human cancer cells that express the GnRH receptors on 
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their surface. Examples of such human cancer cells are prostate, breast 
and endometrial cancer cells. 

It is also preferred that the target cell-specific cell-surface receptor is the 
5 receptor for melanocyte-stimulating hormone (MSH) which is expressed 
in high number in melanoma cells. In this preferred embodiment the 
binding moiety is MSH, and its binding specificity is for melanoma cells. 

It is also preferred that the target cell-specific cell-surface receptor is the 
10 receptor for somatostatin. 

Of course, the receptors for GnRH, MSH and somatostatin may 
themselves be target cell-specific antigens and may be recognised by 
binding moieties which have the property of any one of a monoclonal 
15 antibody, a ScFv, a dAb or a minimal recognition unit. Thus, although 
the binding site on the target cell may be a cell-surface receptor it may 
also act as a target cell-specific cell-surface antigen for recognition by the 
binding moiety. 

20 It will be appreciated by those skilled in the art that binding moieties 
which are polypeptides may be conveniently made using recombinant 
DNA techniques. The binding moiety may be fused to a protein on the 
surface of the virus or virus-like protein as disclosed below or they may 
be synthesized independently of the virus or virus-like particle, by 

25 expression from a suitable vector in a suitable host and men joined to the 
virus or virus-like particle as disclosed below. 

Nucleic acid sequences encoding many of the targeting moieties are 
known, for example those for peptide hormones, growth factors, cytokines 
30 and the like and may readily be found by reference to publicly accessible 
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nucleotide sequence databases such as EMBL and GenBank. Once the 
nucleotide sequence is known it is obvious to the person skilled in the art 
how to make DNA encoding the chosen binding moiety using, for 
example, chemical DNA synthetic techniques or by using the polymerase 
5 chain reaction to amplify the required DNA from genomic DNA or from 
tissue-specific cDNA. 

Many cDNAs encoding peptide hormones, growth factors, cytokines and 
the like, all of which may be useful as binding moieties, are generally 
10 available from, for example British Biotechnology Ltd, Oxford, UK. 

It is preferred that when the virus or virus-like particle of the invention 
binds to its target cell it delivers its nucleic acid to the said target cell, that 
is the target cell is infected by the virus or virus-like particle. Target 
IS cells, especially cancer cells, that are infected in this manner by the virus 
or virus-like particle may express viral molecules on their surface and may 
be recognised by the immune system and destroyed. Of course, other 
cytotoxic functions of the virus may also kill the cell. 

20 In one embodiment when the virus or virus-like particle is adenovirus, the 
E1B gene is substantially deleted or modified so that its gene product no 
longer interacts with the El A protein. El A protein stimulates apoptosis 
but normally its action is inhibited by E1B. Conveniently, the E1B gene 
is inactivated by insertion; preferably a cytotoxic gene, as defined below, 

25 is inserted at or near the E1B gene. 

El, E3 and a site upstream of E4 may be used as sites for insertion of 
foreign DNA sequences in the generation of recombinant adenoviruses for 
example see Berkner and Sharp (1984) Nucl. Acids Res. 12, 1925-1941; 
30 Chanda a a! (1990) Virology 175, 535-547; Haj-Ahmad and Graham 
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(1986) J. Virol. 57, 267-274; Saito et al (1985) J. Virol. 54, 711-719; all 
incorporated herein by reference. Since the upper size limit for DNA 
molecules that can be packaged into adenovirus particles is approximately 
105% of the wild-type genome only about 2 kb of extra DNA can be 
5 inserted without compensating deletions of viral DNA. Although El is 
essential for virus replication in cell culture, foreign DNA can be 
substituted for £1 sequences when the virus is grown in 293 cells which 
are transformed by Ad5 DNA and constitutively express £1 (Graham et 
al (1977) J. Gen. ViroL 36, 59-72, incorporated herein by reference). 
10 Several vectors having 1.9 kb deleted from E3 of Ad5 have been 
constructed without interfering with virus replication in cell culture 
(reviewed by Graham and Prevec (1992) in "Vaccines: New Approaches 
to Immunological Problems" R.W. Ellis (Ed.), Butterworth-Heinemann, 
Boston, MA, pages 364-390, incorporated herein by reference). Such 
15 vectors allow for insertion of up to 4 kb of foreign DNA. Recombinant 
adenoviruses containing inserts in E3 replicate in all Ad-permissive cell 
lines and a number of adenovirus vectors containing E3 inserts have been 
shown to express foreign genes efficiently both in vitro and in vivo 
(Berkner (1988) Biotechniques 6, 616-629; Chanda et al (1990) Virology 
20 175, 535-547; Dewar et al (1989) /. ViroL 63, 129-136; Graham (1990) 
Trends Biotechnol. 8, 85-87; Graham and Prevec (1992) in "Vaccines: 
New Approaches to Immunological Problems" R.W. Ellis (Ed.), 
Butterworth-Heinemann, Boston, MA, pages 364-390; Johnson et al 
(1988) Virology 164, 1-14; Lubeck et al (1989) Proc. Natl Acad. Sci. 
25 USA 86, 6763-6767; McDermott et al (1989) Virology 169, 244-247; 
Morin a al (1987) Proc. Nad. Acad. Sci. USA 84, 4626-4630; Prevec et 
al (1989) J. Gen. Virol. 70, 429^134; Prevec et al (1990) J. Inf. Dis. 161, 
27-30; Schneider et al (1989) J. Gen. Virol. 70, 417-427; Vemon et al 
(1991) J. Gen. Virol. 72, 1243-1251; Yuasa et al (1991) J. Gen. Virol. 
30 72, 1927-1934) all incorporated herein by reference. 
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Substantially replication-defective adenoviruses may be made by creating 
a deficiency of the E1A protein. Suitably this is achieved by deleting the 
E1A gene or by making mutations within the E1A gene that prevent 
expression of the El A protein. Examples of suitable mutations are 
5 deletions within the E1A coding region; nonsense mutations; and 
frameshift mutations. 

In further preference, the virus or virus-like particle is modified further 
to contain a gene suitable for gene therapy. 

10 

In one embodiment, the gene encodes a molecule having a directly or 
indirectly cytotoxic function. By "directly or indirectly" cytotoxic, we 
mean that the molecule encoded by the gene may itself be toxic (for 
example ricin; tumour necrosis factor; interleukin-2; interferon-gamma; 
15 ribonuclease; deoxyribonuclease; Pseudomonas exotoxin A) or it may be 
metabolised to form a toxic product, or it may act on something else to 
form a toxic product. The sequence of ricin cDNA is disclosed in Lamb 
et al (1985) Eur. J. Biochem. 148, 265-270 incorporated herein by 
reference. 

20 

For example, it would be desirable to target a DNA sequence encoding an 
enzyme using the virus or virus-like particle of the invention, the enzyme 
being one that converts a relatively non-toxic prodrug to a toxic drug. 
The enzyme cytosine deaminase converts 5-fluorocytosine (5FC) to 5- 

25 fiuorouracil (5FU) (Mullen et al ( 1 922) PNAS 89, 33); the herpes simplex 
enzyme thymidine kinase sensitises cells to treatment with the antiviral 
agent ganciclovir (GCV) or aciclovir (Moolten (1986) Cancer Res. 46, 
5276; Ezzedine et al (1991) New Biol 3, 608). The cytosine deaminase 
of any organism, for example E. coli or Saccharomyces cerevisiae, may 

30 be used. 
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Thus, in a preferred embodiment of the invention, the gene encodes a 
cytosine deaminase and the patient is concomitantly given 5FC. By 
"concomitantly", we mean that the 5FC is administered at such a time, in 
relation to the transformation of the tumour cells, that 5FC is converted 
5 into SFU in the target cells by the cytosine deaminase expressed from the 
said gene. A dosage of approximately 0.001 to 100.0 mg 5FC/kg body 
weight/day, preferably 0.1 to 10.0 mg/kg/day is suitable. 

Components, such as 5FC, which are converted from a relatively non- 
10 toxic form into a cytotoxic form by the action of an enzyme are termed 
"pro-drugs''. 

Other examples of pro-drug/enzyme combinations include those disclosed 
by Bagshawe et al (WO 88/07378), namely various alkylating agents and 
15 the Pseudomonas spp. CPG2 enzyme, and those disclosed by Epenetos & 
Rowlinson-Busza (WO 91/11201), namely cyanogenic pro-drugs (for 
example amygdalin) and plant-derived 0-glucosidases. 

Enzymes that arc useful in this embodiment of the invention include, but 
20 are not limited to, alkaline phosphatase useful for converting phosphate- 
containing prodrugs into free drugs; arylsulfatase useful for converting 
sulfate -containing prodrugs into free drugs; cytosine deaminase useful for 
converting non-toxic 5-fluorocytosine into the anti-cancer drug, 5- 
fluorouracil; proteases, such as serratia protease, thermolysin, subtilisin, 
25 carboxypeptidases and cathepsins (such as cathepsins B and L), that are 
useful for converting peptidc-containing prodrugs into free drugs; Ph- 
alanx! carboxypepndases, useful for converting prodrugs that contain D- 
amino acid substituents; carbohydrate-cleaving enzymes such as /S- 
galactosidase and neuraminidase useful for converting glycosylated 
30 prodrugs into free drugs; 0-lactamase useful for converting drugs 
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derivatized with 0-lactams into free drugs; and penicillin amidases, such 
as penicillin V a mi dase or penicillin G amidase, useful for converting 
drugs derivatized at their amine nitrogens with phenoxyacetyl or 
phenylacetyl groups, respectively, into free drugs. Alternatively, 
5 antibodies with enzymatic activity, also known in the art as abzymes, can 
be used to convert the prodrugs of the invention into free active drugs 
[see, e.g. R J Massey, Nature, 328, pp. 457-458 (1987)]. 

Similarly, the prodrugs of this invention include, but are not limited to, 
10 the above-listed prodrugs, e.g., phosphate-containing prodrugs, 
thiophosphate -containing prodrugs, sulfate -containing prodrugs, peptide- 
containing prodrugs, D-amino acid-modified prodrugs, glycosylated 
prodrugs, /S-lactam -containing prodrugs, optionally substituted 
phenoxyacetamide-containing prodrugs or optionally substituted 
15 phenylacetamide-containing prodrugs, 5-fluorocytosine and other 5- 
fluorouridine prodrugs which can be converted by the enzyme of the 
conjugate into the more active, cytotoxic free drug. Examples of 
cytotoxic drugs that can be derivatized into a prodrug form for use in this 
invention include, but are not limited to, etoposide, teniposide, 
20 adriamycin, daunomycin, carminomycin, aminopterin, dactinomycin, 
mitomycins, cis-platinum and cis-platinum analogues, bleomycins, 
esperamicins [see U.S. PaL No. 4,675,187], 5-fluorouracil, melphalan and 
other related nitrogen mustards. 

25 In a further embodiment the gene delivered to the target cell encodes a 
ribozymc capable of cleaving targeted RNA or DNA. The targeted RNA 
or DNA to be cleaved may be RNA or DNA which is essential to the 
function of the cell and cleavage thereof results in cell death or the RNA 
or DNA to be cleaved may be RNA or DNA which encodes an 

30 undesirable protein, for example an oncogene product, and cleavage of 
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this RNA or DNA may prevent the cell from becoming cancerous. 

Ribozymes which may be encoded in the genomes of the viruses or virus- 
like particles herein disclosed are described in Cech and Herschlag "Site- 
5 specific cleavage of single stranded DNA" US 5,180,818; Altman et al 
"Cleavage of targeted RNA by RNAse P" US 5,168,053, Cantin et al 
"Ribozyme cleavage of HTV-1 RNA" US 5,149,796; Cech et al "RNA 
ribozyme restriction endoribonucleases and methods", US 5,116,742; 
Been et al "RNA ribozyme polymerases, dephosphorylases, restriction 
10 endonucleases and methods, US 5,093,246; and Been et al "RNA 
ribozyme polymerases, dephosphorylases, restriction endoribonucleases 
and methods; cleaves single-stranded RNA at specific site by 
transcs ten fi cation", US 4,987,071, all incorporated herein by reference. 

15 In a still further embodiment the gene delivered to the target cell encodes 
an antisense RNA. 

By "antisense RNA" we mean an RNA molecule which hybridises to, and 
interferes with the expression from a mRNA molecule encoding a protein 
20 or to another RNA molecule within the cell such as pre-mRNA or tRNA 
or rRNA, or hybridises to, and interferes with the expression from a gene. 

Conveniently, a gene expressing an antisense RNA may be constructed by 
inserting a coding sequence encoding a protein adjacent a promoter in the. 
25 appropriate orientation such that the RNA complementary to mRNA. 
Suitably, the antisense RNA blocks expression of undesirable polypeptides 
such as oncogenes, for example ras, bet, sre or tumour suppressor genes 
such as p53 and Kb. 

30 It will be appreciated that it may be sufficient to reduce expression of the 
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undesirable polypeptide rather than abolish the expression. 



It will be further appreciated that DNA sequences suitable for expressing 
as antisense RNA may be readily derived from publicly accessible 
5 databases such as GenBank and EMBL. 

In another embodiment of the invention, the gene replaces the function of 
a defective gene in the target cell. 

10 There are several thousand inherited genetic diseases of mammals, 
including humans, that are caused by defective genes. Examples of such 
genetic diseases include cystic fibrosis, where there is known to be a 
mutation in the CFTR gene; Duchenne muscular dystrophy, where there 
is known to be a mutation in the dystrophin gene; sickle cell disease, 

15 where there is known to be a mutation in the HbA gene. Many types of 
cancer are caused by defective genes, especially protooncogenes, and 
tumour-suppressor genes that have undergone mutation. 

Thus, it is preferred that the virus or virus-like particle of the invention, 
20 which may be useful in the treatment of cystic fibrosis, contains a 
functional CFTR gene to replace the function of the defective CFTR gene. 
Similarly, it is preferred that the virus or virus-like particle of the 
invention, which may be useful in the treatment of cancer, contains a 
functional protooncogene, or tumour-suppressor gene to replace the 
25 function of the defective protooncogene or tumour-suppressor gene. 

Examples of protooncogenes are ras, src, bcl and so on; examples of 
tumour-suppressor genes are p53 and Rb. 



30 



By "gene" we mean a nucleic acid coding sequence that may contain 
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introns, or fragment thereof, or cDNA, or fragment thereof. 

It will be appreciated that the gene will be introduced into a convenient 
place within the genome of the virus or virus-like particle and will contain 
5 a promoter and/or enhancer element to drive its expression. 

It is preferred if the promoter and/or enhancer is selective for the cells to 
be targeted. Some examples of tissue or tumour specific promoters are 
given below but new ones are being discovered all of the time which will 
10 be useful in this embodiment of the invention. 

The tyrosinase and TRP-1 genes both encode proteins which play key 
roles in the synthesis of the pigment melanin, a specific product of 
melanocytic cells. The 5' ends of the tyrosinase and ryrosinase-related 
IS protein (TRP-1) genes confer tissue specificity of expression on genes 
cloned downstream of these promoter elements. 

The 5' sequences of these genes are described in Bradl, M. et al (1991) 
Proc. Natl Acad. Sci. USA 88, 164-168 and Jackson, I.J. et al (1991) 
20 Nucleic Acids Res. 19, 3799-3804. 

Prostate-specific antigen (PSA) is one of the major protein constituents of 
the human prostate secretion. It has become a useful marker for the 
detection and monitoring of prostate cancer. The gene encoding PSA and 
25 its promoter region which directs the prostate-specific expression of PSA 
have been described (Lund wall (1989) Biochem. Biophys. Res. Comm. 
161, 1 151-1159; Riegman et al (1989) Biochem, Biophys. Res. Comm. 
159, 95-102; Brawer (1991) Acta Oncol 30, 161-168). 

30 Carrinoembryonic antigen (CEA) is a widely used tumour marker, 
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especially in the surveillance of colonic cancer patients. Although CEA 
is also present in some normal tissues, it is apparently expressed at higher 
levels in tumorous tissues than in corresponding normal tissues. The 
complete gene encoding CEA has been cloned and its promoter region 
5 analysed. A CEA gene promoter construct, containing approximately 400 
nucleotides upstream from the translational start, showed nine times higher 
activity in the adenocarcinoma cell line SW303, compared with the HeLa 
cell line. This indicates that cw-acting sequences which convey cell type 
specific expression are contained within this region (Schrewe et al (1990) 
10 Mol. Cell. Biol. 10, 2738-2748). 

The c-erbB-2 gene and promoter have been characterised previously and 
the gene product has been shown to be over-expressed in tumour cell lines 
(Kraus et al (1987) EMBOJ. 6, 605-610). 

15 

The mucin gene, MUC1, contains 5' flanking sequences which are able 
to direct expression selectively in breast and pancreatic cell lines, but not 
in non-epithelial cell lines as taught in WO 91/09867. 

20 The binding moiety allowing the virus or virus-like particle to bind to a 
target cell may be a polypeptide or oligosaccharide or lipid or any other 
molecule capable of binding specifically to the target cell. 

It is preferred that the binding moiety is a polypeptide. 

25 

The molecule on the surface of the virus or virus-like particle to which the 
binding moiety is joined may be a polypeptide, oligosaccharide or lipid or 
any other molecule in the virus or virus-like particle coaL It is preferred 
that the molecule is a polypeptide. 



30 
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If the binding moiety and the molecule on the surface of the virus or 
virus-like particle are both polypeptides then they may be linked together 
by any of the conventional ways of cross-linking polypeptides, such as 
those generally described in O'Suilivan ex al Anal Biochem. (1979) 100, 
5 100-108. For example, the binding moiety may be enriched with thiol 
groups and the molecule on the surface of the virus or virus-like particle 
reacted with a Afunctional agent capable of reacting with those thiol 
groups, for example the N-hydroxysuccinimide ester of iodoacetic acid 
(NHIA) or N-succinimidyl-3-(2-pyridyldithio)propionate (SPDP). Amide 
10 and thioether bonds, for example achieved with m-maleimidobenzoyl-N- 
hydroxysuccinimide ester, are generally more stable in vivo than 
disulphide bonds. 

Other chemical procedures may be useful in joining oligosaccharide and 
IS lipids to other oligosaccharides, lipids or polypeptides. 

It is preferred that the binding moiety and the molecule on the surface of 
the virus or virus-like particle are both polypeptides that may be produced 
as a fusion by the techniques of genetic engineering. The use of genetic 
20 engineering allows for the precise control over the fusion of such 
polypeptides. 

Thus a further embodiment of the invention is a nucleotide sequence 
encoding the fusion of the binding moiety and the protein on the surface 
25 of the virus or virus-like particle. 

The nucleotide sequence encoding the fusion of the binding moiety and the 
protein on the surface of the virus or virus-like particle is preferably made 
by an alteration of the viral genome. 

30 
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The nucleotide sequence may be synthesised de novo using solid phase 
phosphoramidite chemistry, but it is more usual for the nucleotide 
sequence to be constructed from two parts, the first encoding the binding 
moiety and the second the protein on the surface of the virus or virus-like 
5 particle. The two parts may be derived from their respective genes by 
restriction endonuclease digestion or by other methods known by those 
skilled in the art such as the polymerase chain reaction. 

A variety of methods have been developed to operatively link two 
10 nucleotide sequences via complementary cohesive termini. For instance, 
synthetic linkers containing one or more restriction sites provide a method 
of joining the two DNA segment together. Each DNA segment, generated 
by endonuclease restriction digestion, is treated with bacteriophage T4 
DNA polymerase of E. coli DNA polymerase I, enzymes that remove 
15 protruding, 3'-single-stranded termini with their 3'-5*-exonucleolytic 
activities, and fill in recessed 3' -ends with their polymerizing activities. 

The combination of these activities therefore generates blunt-ended DNA 
segments. The blunt-ended segments are then incubated with a large 

20 molar excess of linker molecule* in the presence of an enzyme that is able 
to catalyze the ligation of blunt-ended DNA molecules, such as 
bacteriophage T4 DNA ligase. Thus, the products of the reaction are 
DNA segments carrying polymeric linker sequences at their ends. These 
DNA segments are then cleaved with the appropriate restriction enzyme 

25 and lifted to an expression vector that has been cleaved with an enzyme 
that produces termini compatible with those of the DNA segment. 

Synthetic linkers containing a variety of restriction endonuclease sites are 
commercially available from a number of sources including International 
30 Biotechnologies Inc. New Haven, CN, USA. 
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A desirable way to generate the DNA encoding the fusion polypeptide of 
the invention is to use the polymerase chain reaction as disclosed by Saiiri 
et al (1988) Science 239, 487-491 . 

5 In this method each of the DNA molecules encoding the two polypeptides 
to be fused are enzymatically amplified using two specific oligonucleotide 
primers which themselves become incorporated into the amplified DNA. 
The said specific primers may contain restriction endonuclease recognition 
sites which may then be used to join the said two DNA molecules using 
10 T4 DNA ligase as disclosed. 

A particular feature of one aspect of the present invention is the 
modification of the virus or virus-like particle of the invention so that it 
no longer binds its host cell and so that it binds the target cell by virtue 
15 of its binding moiety. 

The host-cell receptor of adenovirus may be the penton fibre and that of 
influenza virus may be the haemagglutinin receptor. 

20 These receptors may be modified by the insertion or deletion or 
substitution of amino acid residues that disrupt their host-cell binding 
function. It is preferred that the binding moiety for the target cell is 
joined to the host-cell receptor in such a manner mat the binding moiety 
is .capable of binding the target cell, the host-cell receptor is unable to 

25 bind to the host cell and therefore the binding specificity of the virus or 
virus-like panicle is modified. A further preference is that the portion of 
the host-cell receptor that is exposed on the surface of the virus or virus- 
like particle is replaced by the binding moiety, and that the portion of the 
host-cell receptor which promotes the uptake of viral DNA by the target 

30 cell is retained. Suitably, the binding moiety is joined directly or 
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indirectly to the host-cell receptor by a spacer group. 

Examples of spacer groups are polypeptide sequences of between 4 and 
1000 amino acid residues. 

5 

Thus, in one embodiment of the invention the gene encoding the penton 
fibre in adenovirus is modified in such a way that the DNA encoding the 
surface-exposed portion is replaced by a DNA fragment encoding a ScFv, 
the ScFv being derived from an antibody which binds to a target cell 
10 surface antigen. 

Potential fusion sites within the penton fibre have been identified. 

The adenovirus fibre is a trimer composed of three protomers. The amino 
15 terminal end (40 amino acids or so) of each participates in the formation 
of a tail that is closely associated with the penton (as opposed to the 
hexon) subunit of the capsid. High amino acid conservation is maintained 
between the different characterised serotypes. 

20 Middle portions of each protomer form the shaft of the protein. This shaft 
is of variable length, depending upon serotype, and is composed of 
repeating units of 15 amino acids (for examples, serotypes have been 
identified with 6, 15 and 21 repeat units). These repeating units are not 
duplicates: rather than strict conservation of amino acid structure, there 

25 is a general conservation of relative hydrophobicity. Some serotypes, for 
example, 40 and 41, have shafts composed of different length fibre 
proteins. This suggests a certain flexibility in structural constraints. 

The carboxy-terminal ends (some 200 amino acids) associate to form a 
30 knob that is held erect a great distance (in molecular terms) from the 
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capsid. 

Whilst the cellular receptor(s) and mechanisms of docking have not been 
firmly identified and elucidated, we propose that the most likely ranriiriatr 
5 structure for cell binding is the knob. Thus, in one embodiment the whole 
knob of the penton fibre has been replaced with single chain antibody 
(ScFv) domains. The triplex structure implies that each fibre will thus end 
in three ScFvs. Additionally, the ScFv regions can be replaced with 
CDRs, or by non-antibody derived peptides, of known specificity or other 
10 molecules that are capable of interacting specifically with the target cell. 

Suitable fusion sites are therefore at the native junction between shaft and 
knob domains, or (should the DNA sequence prove to be more amenable) 
at any junction between repetitive units of the shaft. Preferably, the 
IS minimum shaft length is not reduced beyond the smallest size naturally 
identified. There are thus at least IS potential sites at which fusion could 
be contemplated. 

Although it is preferred that the binding moiety forms the end of the fibre 
20 thereby replacing the knob, the binding moiety may also be fused within 
the penton fibre sequence but still display its binding surfaces and bind to 
the target cell. 

Suitably, the binding moiety may be fused to the knob and extend 
25 externally to the knob structure. 

In a further embodiment influenza virus haemmaglutinin is modified to 
incorporate a binding moiety. Influenza virus has seven or eight 
(depending on serotype) genetic segments, all negative strand RNA. 
30 Suitably, a cDNA from the whole segment encoding haemmagglutinin is 
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constructed and modified by adding a promoter firing backwards across 
this segment so that negative strand RNA is made. Genetic fusions with 
a suitable binding molecule, as disclosed above, are made using standard 
recombinant DNA methods and a suitable cell line is stably transfected 
5 with this gene construct Infection of this transfected cell line with 
influenza virus and selection of reasserted genomes containing the new 
haemmagglutinin by infection of a normally resistant cell line that 
expresses a marker that can only be recognised by the new 
haemmagglutinin yields the desired virus comprising modified cell-binding 
10 specificity. 

A further aspect of the invention provides a method of producing in cell 
culture a vims or virus-like particle and then joining the binding moiety, 
as defined above, to the virus or virus-like particle. 

15 

A further aspect of the invention provides a method of producing in cell 
culture a virus or virus-like particle which has been genetically modified 
to express a binding moiety on its surface. The virus or virus-like particle 
is grown in its host prior to modification, but once the modification that 
20 alters the binding specificity is made, the virus or virus-like particle is 
grown in the target cell. Thus, for example in the case where the binding 
moiety recognises a breast tumour cell antigen, the virus or virus-like 
particle is grown in breast tumour cell culture. 

25 The virus or virus-like particles of the invention are administered in any 
suitable way, usually parenterally, for example intravenously, 
intraperitoneaUy or intravesically, in standard sterile, non-pyrogenic 
formulations of diluents and carriers, for example isotonic saline (when 
administered intravenously). 



30 
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A further aspect of the invention provides a method of delivery of the 
virus or virus-like particle which contains a gene encoding a molecule 
having an indirectly cytotoxic function. 

5 Suitably, the indirectly cytotoxic function is an enzyme that converts a 
prodrug to a toxic drug. With such a virus or virus-like particle, once die 
virus or virus-like particle has bound to the target cells, delivered its 
nucleic acid to the cells, and expressed the indirectly cytotoxic functions, 
which typically takes a day or so, the pro-drug is administered. The 
10 timing between administration of the virus or virus-like particle and the 
pro-drug may be optimised in a non-inventive way. 

The dosage of the pro-drug will be chosen by the physician according to 
the usual criteria. The dosage of the virus or virus-like particle will 
15 similarly be chosen according to normal criteria, and in the case of tumour 
treatment, particularly with reference to the type, stage and location of 
tumour and the weight of the patient. The duration of treatment will 
depend in part upon the rapidity and extent of any immune reaction to the 
virus or virus-like particle. 

20 

Some of the viruses or virus-like particles either in themselves, or together 
with an appropriate pro-drug, arc in principle suitable for the destruction 
of cells in any tumour or other defined class of cells selectively exhibiting 
a recognisable (surface) entity. Examples of types of cancer that may be 
25 treated using the viruses or virus-like particles are cancer of the breast, 
prostate, colon, rectum, ovary, testicle and brain. The compounds are 
principally intended for human use but could be used for treating other 
mammals including dogs, cats, cattle, horses, pigs and sheep. 

30 The invention will now be described in detail with reference to the 



WO 94/10323 



PCT/GB93/02267 



30 

following Figures and Examples in which: 

Figure 1 shows (a) an unmodified (i.e. "naive") virus or virus-like particle 
able to bind to and infect its host cell but not a non-host cell, such as a 
5 target cell; and (b) a virus or virus-like particle with a modified binding 
specificity does not bind and infect its host cell but binds and infects a 
target cell; and (c) a virus or virus-like particle as in (b) modified further 
to contain a gene for gene therapy or cancer treatment. 

10 Figure 2 shows (a) unmodified (naive) adenovirus; (b) adenovirus 
modified so that its penton fibres, which recognise the host cell, are 
replaced in part by antibody fragments which recognise the target cell; and 
(c) adenovirus as in (b) with further genetic material added to the viral 
DNA for gene therapy of cancer. 

15 

Figure 3 shows (a) influenza virus and (b) genetically-modified influenza 
virus wherein at least part of the haemagglutinin binding site is replaced 
by an antibody with anti-cancer cell binding activity. 

20 Figure 4 shows (a) a retrovirus virus; and (b) as in (a) except the 
retrovirus has been modified further to express on its surface an anticancer 
cell-binding antibody fragment or an anticancer cell-binding peptide. 

Figure 5 is a diagrammatic representation of a penton fibre indicating 
25 potential fusion sites within the fibre. 

Figure 6 shows fusions between the DNA encoding the Ad5 fibre and an 
ScFv. 

30 Figure 7 shows sequences of oligonucleotides used for amplifying the 
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ScFv. All oligonucleotides are presented 5' to 3', the reverse complement 
of FOR primers are shown and derived amino acid sequences are shown 
where relevant. 

5 Figure 8 shows the construction of plasmid pRAS117. 

Figure 9 shows the nucleotide and derived amino acid sequence between 
the HindUI and EcoRI sites of pRAS117. 

10 Figure 10 shows a map of plasmid pRAS117. 

Figure 11 is a diagrammatic representation of the construction of plasmid 
pRAS118. 

15 Figure 12 shows the sequences of oligonucleotides for amplifying Ad5 
fibre DNA fragments. All oligonucleotides are presented 5' -» 3'. The 
reverse complements of FOR primers are shown. Derived amino add 
sequences are shown where relevant. 

20 Figure 13 shows the nucleotide sequence and deduced amino acid 
sequence between the HindJJl site and EcoRI site of pRASlll. 

Figure 14 gives a diagrammatic representation of constructing adenovirus 
carrying a cytotoxic gene. 

25 

Figure 15 gives the nucleotide and amino acid sequences of mouse and 
humanised HMFG1 variable regions. 
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Example 1: Fusion sites within the aripnnvirus Ad5 fibre for hinriinp 
moieties including single chain Fv (ScFvl 

The Ad5 DNA sequence co-ordinates used here are taken from: 
5 ADRCOMPGE.l: residues 1 to 32760 

and ADRCOMPGE.2: residues 32761-35935 

These can be accessed by using program SEQ on the Intelligenetics 
database. 

10 

The sequence of Ad5 fibre can also be found in Chroboczek, J. and 
Jacrot, B. (1987) "The sequence of adenovirus fiber. Similarities and 
differences between serotypes 2 and 5" Virology 161, 549-554 and is 
available from the EMBL Database, Heidelberg, Germany under accession 
15 name ADEFIB. 

Fusion sequences between the shaft and the ScFv are shown in Fig. 6. 
The fusion sites are at the junctions of the repetitive units of the shaft. 
Shaft sequences are shown in normal typescript; ScFv sequences are 
20 shown in italics. The DNA sequence between the Pstl and Xhol sites is 
unique to the ScFv used. 

Fusion A is at the end of the first repetitive unit of the shaft (co-ordinates 
31218-9), fusion B at the end of the second (31266-7), fusion C at the 

25 third (31323-4), fusion D at the fourth (31368-9), fusion E at the fifth 
(31413-4), fusion F at the sixth (31458-9), fusion G at the seventh (31503- 
4), fusion H at the eighth (31551-2), fusion I at the ninth (31596-7), 
fusion J at the tenth (31641-2), fusion K at the eleventh (31692-3), fusion 
L at the twelfth (31737-8), fusion M at the thirteenth (31787-8), fusion N 

30 at the fourteenth (31836-7), fusion O at the fifteenth (31884-5), fusion P 
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at the sixteenth (31929-30), fusion Q at the seventeenth (31995-6), fusion 
R at the eighteenth (32040-1), fusion S at the nineteenth (32103-4), fusion 
T at the twentieth (32151-2), fusion U at the twenty-first (32199-200), and 
fusion V is at the end of the twenty-second repetitive unit of the shaft 
5 (32244-5), the junction between shaft and knob. 

Example 2: Preparation of adenovirus expressi ng an ScFv n n fa 

surface 

10 The genetically modified fibre is introduced into the Ad5 genome by: (a) 
replacing the fibre gene of plasmid pE4 with the modified fibre by 
standard recombinant DNA technology and (b) reconstituting the virus by 
recombination. 

15 pE4 is a plasmid containing the right hand half of the Ad5 genome, and 
which has served as the source of the Ad5 fibre gene that we have used. 
It was provided by Dr Keith Leppard, Biological Sciences, University of 
Warwick, Coventry, CV4 7AL who has supplied details of its structure. 
If it is introduced into mammalian cells that contain the remainder of the 

20 Ad5 genome, then it is possible to obtain recombinants containing the 
modification. Most human cell lines can be used for the recombination 
but HeLa cells are preferred. 

The plasmid pE4 is readily made in the following way. A derivative of 
25 pBR322 is made by digesting with iteNl and rejoining using Xhol linkers 
such that the BstNl fragment corresponding to positions 1442-2502 in the 
pBR322 sequence is removed. DNA from the adenovirus Ad5 strain 309 
described by Jones & Shenk (1979) Cell 17, 683-689 is isolated and 
deproteinated. This DNA is then ligated to ClcH linkers and cut with 
30 EcdBl and Clal. The Clal-EcoRI fragment corresponding to the region 
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of 76% of the Ad5 genome to the right hand end is isolated and cloned 
into the EcdBl-Odl sites of the above-mentioned pBR322 derivative to 
form pE4. 

5 Adenovirus Type 5 and HeLa cells are available from the American Type 
Culture Collection, 12301 Packlawn Drive, Rockvffle, MD 20852-1776, 
USA under accession numbers ATCC VR-5 and ATCC CCL-2. 



10 



20 



25 



Construction of plasmid pRAS117 



Oligonucleotide primers LEADHB ACK and LEADbFOR (Figure 7) were 
used for PCR-mediated amplification of the DNA segment extending from 
the HinSm site of plasmid pRASlll, over the Shine-Dalgarno sequence 
and the pelB leader sequence to the Pstl site in the ScFv. LEADbFOR 
15 directs the incorporation of a BglH. site immediately after the pelB leader 
sequence. DNA (100 ng) from plasmid pRASlll was subjected to 24 
rounds of amplification, (94"C, 1 min; 65°C, 1.5 min and 72°C, 2 min) 
in a 50 fd reaction volume containing 25 pmol of each primer, 250 mM 
of each dNTP, 67 mM Tris-HCl (pH 8.8), 17 mM (NH^S0 4 , 1.5 mM 
MgClj, 200 mg.ml-' gelatine and 5 units of Thermus aquaticus (Taq) 
polymerase (Cetus) overlaid with 25 ,d paraffin oil. After the reaction, 
oil was removed by extraction with 500 fd chloroform. The sample was 
loaded on a 2% agarose gel, and the amplified fragment was 
electrophoresed on to a piece of NA45 paper (Schleicher and SchueU). 
Bound DNA was subsequently eluted by immersion in 400 /d 1M Nad 
made in TE (10 mM Tris-HCl, pH 7.5, 1 mM EDTA) for 30 min at 
70°C. To this was added 800 /xl ethanol, and after incubation (2 h, - 
20°C) the DNA was collected by centrifugation. The pellet was taken up 
in 50 Ad T/E. 



30 
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One fifth (10 fd) of the purified amplified fragment was cut with the 
restriction enzymes Hindm and Pstl, in a total volume of 20 fd 50 mM 
Tris-HCl, pH 7.5, 10 MgC12, 100 mM NaCl, 1 mM dithic>erythreitol 
containing 10 units of each enzyme. After incubation (1 h, 37°C) the 
5 reaction was stopped by incubation at 70°C for 15 minutes. 

The trimmed amplified fragment was cloned between the Hindm and Pstl 
sites of pUC8, to generate plasmid pRAS117. 

10 Plasmid pUC8 (1 fig) was cut with Hindm and Pstl, in a total volume of 
20 50 mM Tris-HCl, pH 7.5, 10 MgCL,, 100 mM NaCi, 1 mM 
dithioerythreitol containing 10 units of each enzyme. After incubation (1 
h, 37°C) the reaction was stopped by incubation at 70°C for 15 minutes. 

15 The ligation reaction contained 1.5 /J of pUC8/flindin, Pstl and 3 fd of 
the amplified leaderAtfindlll, Pstl in a total volume of 15 fd containing 70 
mM Tris-HCl pH 7.5, 7 mM MgCl lf 0.7 mM rATP, 4 mM dithiothreitol, 
0.5 mg.ml-' BSA and 10 units of T4 DNA ligase. After incubation (2 h, 
at room temperature), the reaction was stopped by the addition of 1 fd 500 

20 mM EDTA, pH 8.0 and 14 /d H 2 0. 

This ligation mix was used to transform E. colt. 

An aliquot (5 fd) of this ligation mix was used to transform a 200 /d 
25 aliquot of commercially available competent E. coli K12 DH55,7aF (Life 
Sciences Inc). After incubation (30 min, 0°C), heat shock (2 min, 42°C), 
addition of 800 /d L-broth and recovery (37°C, 1 h), cells (100 fd) were 
spread on L-agar plates containing 100 Aig.ml* 1 ampicillin containing 50 
mM IPTG Osopropyl-0-D-galactopyranoside) and 100 fig.nl' 1 X-Gal (5- 
30 bromo-4K:hloro-3-indolyl-/5-D-galactopyranoside). Cells were grown 
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overnight at 37°C, and individual colonies were transferred to fresh L- 
agar/ampicillin plates. After 6 h growth, colonies were used to inoculate 
5 ml aliquots of L-broth containing 100 ug.ml' 1 ampicillin. These cells 
were grown overnight with shaking at 37°C, and used as a source of 
5 plasmid DNA. 

These cells were used as a source of plasmid DNA. 

Harvested cells were suspended in 360 fd of SET (50 mM sucrose, 10 
10 mM EDTA, 100 mM Tris-HCl, pH 7.5) containing 2 mg.mJ- 1 hen egg 
lysozyme, transferred to a 1 .5 ml microfuge tube, and diluted by addition 
of 300 /xl 10% Triton X-100. After floating on boiling water for 2 min 
and cooling for a further minute in ice/water, denatured cell debris was 
removed by centrifugation (14,000 x g, 20 min) in a microcentrifuge. 
15 The majority of the soluble remaining proteins were removed by addition 
of 300 pi 7.5 M ammonium acetate and centrifugation (14,000 x g, 10 
min). Nucleic acids were precipitated by addition of 720 fd cold (-20°Q 
isopropanol and centrifugation (14,000 x g, 10 min). After rinsing the 
pellets with ethanol and drying, DNA was solubilised in 60 /d TE 
20 containing 170 /xg.mT'RNase A. 

Restriction enzyme digestions on 5 y\ aliquots, using the enzymes HindUl 
and BglU identified which of these plasmids were pRAS117. The 
construction scheme is shown in Fig. 8. The nucleotide and derived 
25 amino acid sequences between the HindUl and EcoRI sites of pRAS117 
are shown in Fig. 9. A map of plasmid pRAS117 is provided in Fig. 10. 

The nucleotide sequence of the relevant portion of pRASl 1 1, between the 
Hindm site and £coRI, site is given in Figure 13. 

30 
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Construction of plasmid pRAS118 (Figure 11) 

The 130bp HinSm-Pstl fragment of pRAS117 was used to replace the 
corresponding fragment of pRASl 1 1 , to generate plasmid pRASl 18. An 
5 aliquot (2 fig) of pRASlll DNA was cut with HindUI and Psd in the 
conditions used previously, the large fragment was isolated by 
electrophoresis onto NA45 paper, as described previously, and the DNA 
was supended in 10 pi of TE. An aliquot (10 /d) of pRASl 17 DNA was 
cut with HindUI and Pstl in the conditions used previously, and the small 
10 fragment was isolated by electrophoresis onto NA45 paper, as described 
previously, and the DNA was suspended in 10 /d of TE. 

The isolated pRASl 1 UHindmPstl large fragment (1 .5 /d) and the isolated 
pRASl 17/HimSmPstl small fragment (3 pd) were mixed and ligaled in the 
15 conditions previously described. 

Transformations, colony handling and DNA preparations were as 
previously described. 

20 Restriction enzyme digestions on 5 pi aliquots, using the enzymes HindUI, 
Pstl and BgKl identified which of these plasmids were pRASl 18. This 
encodes a NIP-reactive ScFv with a Bgtn. cloning site immediately 
downstream of the pelB leader, suitable for inserting fragments of DNA 
from Ad5 fibre (and also suitable for fusion of any other desired fusion 

25 functions). 

Amplification of Ad5 fibre DNA fragments 

Fragments of DNA from Ad5 fibre were amplified by PCR using 
30 oligonucleotide TAILbBACK and oligonucleotide -FIBREPFOR, 
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FIBRE3FOR, FIBRE6FOR, FIBRE9FOR, FTORE12FOR, FIBRE 15FOR, 
FIBRE18FOR, FBRE21FOR or FIBRE22FOR. Oligonucleotide 
sequences can be found in Fig. 12. 

5 T ATT /TRACK directs the incorporation of a BgM site at the base of the 
fibre, and the FIBREnFOR series primers direct the incorporation of a 
Pari site at the junctions of repetitive shaft units 3-4 (FTBRE3FOR), 6-7 
(FEBRE6FOR), 9-10 (FIBRE9FOR), 12-13 (FIBRE12FOR), 15-16 
(FIBRE 15FOR), 18-19 (FIBRE18FOR), 21-22 (FIBRE21FOR), between 
10 unit 22 and the knob (FIBRE22FOR) or at the end of the knob sequence 
(FIBREPFOR). 



Fusion of fibre and ScFv 



15 The amplified segments of fibre are trimmed with BgM and Ps& and 
ligated between the BglU and Pstl sites of plasmid pRAS118. This gives 
a range of fusions under the transcriptional control of the T7 promoter. 
Colonies are recovered after transformation of a suitable E. coli strain, 
such as DH5, which does not permit expression of the fusions. 

20 

Screening 



Colonies containing candidates for fusion arc identified by restriction 
digestion of their plasmid DNAs. These candidate DNAs are used to 

25 transform a suitable E. coli strain, such as BL21 (DE3), that contains a 
chromosomal insertion of T7 polymerase under lac control. In these cells, 
induction of expression of T7 polymerase using the gratuitous inducer 
IFTG causes expression of the fusion proteins. Soluble NIP-reactive 
material is identified in colonies with correctly assembled fusions. The 

30 DNA of these is identified and the NIP-reactive ScFv derived from 
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pRASl 1 1 are replaced with a cell-binding ScFv. 
Replacing the flbre:ScFv in plasmid pE4 

5 There is a Hindm site approximately half-way along the fibre gene. 
Fusions with long fibres also contain this Hindm site. The fusion is 
introduced at this site. 

Recombination in vivo of plasmid pE4-ScFv with the adenovirus 
10 genome 

To obtain virus particles expressing the ScFv on the penton fibre suitable 
cells, such as 293 cells, are cotransfected with plasmid pE4-ScFv and 
plasmid pFG173 as described in Mittal a al (1993) Virus Res. 28, 67-90, 
incorporated herein by reference. Since neither pFG173 nor pE4-ScFv 
individually is able to generate virus progeny, on transfcction of 293 cells 
viable virus progeny are only produced by in vivo recombination between 
these two plasmids resulting in rescue of the penton fibre-ScFv fusion into 
the Ad5 genome. 

293 cells are human transformed primary embryonal cells available from 
the ATCC under accession number ATCC CRL 1573. 

The adenovirus particles made in this way express a NIP-binding ScFv on 
their surface. Such particles are useful in a two-step targeting approach 
wherein a target-cell specific binding moiety, such as those identified in 
Tables 1 and 2, are joined to NIP molecule and targeted to a cell. Once 
they have localized to the target cell within the patient, the adenovirus 
displaying NIP-binding ScFv is administered to the patient and binds to the 
30 NIP. 
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Example 3; — Insertion of a cytotoxic gene into the E3 r«yinn fjf 
adenovirus Ad5 

In preparation for rescue of the cytotoxic gene into the E3 region of Ad5, 
5 the cytotoxic coding sequences were first inserted into a ™«yttc 
containing the SV40 early promoter and poly A addition sequences to give 
plasmid pTOX as shown in Figure 14. 

To obtain virus with the cytotoxic gene and SV40 regulatory sequences in 
10 the E3 region, 293 cells are cotransfected with plasmids pTOX and 
pFG173 (Fig 14). The plasmid pFG173 is constructed from pFG140, an 
infectious plasmid containing the AdS dl309 genome in circular form by 
inserting a kan r gene at the EcdKL site as 75.9 m.u. as described in Grahm 
(1984) EMBO J. 3, 2917-2922 and Mitall ct al (1993) Virus Res. 28, 67- 
15 90. 

Since neither pFG173 nor pTOX individually is able to generate infectious 
virus progeny, on transfection of 293 cells viable virus progeny are only 
produced by in vivo recombination between these two plasmids resulting 
20 in rescue of the E3 insert into the Ad5 genome. 

Viral plaques obtained after co transfection are isolated and expanded in 
293 cells and viral DNA was analyzed on an agarose gel after digestion 
with HindKL. The structure of the desired Ad5 -cytotoxic gene 
25 recombinant is verified by the presence of diagnostic fragments. One 
recombinant is plaque purified and used for further study. 

Legend to Figure 14 



30 The plasmid pFG173 contains the entire Ad5 genome, except for a 3.2 kb 
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sequence spontaneously deleted between m.u. 75.9-84.9. Plasmids pTOX 
and pFG173 were used for cotransfection of 293 cells to rescue, by in vivo 
recombination, the cytotoxic gene flanked by SV40 regulatory sequences 
in the E3 region of Ad5. The resulting Ad5-cytotoxic gene recombinant 
5 was named Ad5-TOX. The relative positions of HindUI and Xbal 
restriction sites of the Ad5-TOX genome are shown. The position and 
orientation of the SV40 promoter, the cytotoxic gene, and the SV40 
polyadenylation signal are shown below. Solid bars: luciferase gene; 
open bars: SV40 promoter and SV40 polyadenylation signal; hatched 
10 bars: amp r and kan r genes. 

The cytotoxic gene is the cDNA for thymidine kinase. 

Other cytotoxic genes are inserted into the E3 region of Ad5 in an 
15 analogous manner. 



Example 4; Single chain Fv from the mouse mnnnHnn^ nnffl nrri y 

H MFG1 and humanised monoHnn q l antibody Hii HMFH] 

The nucleotide sequences encoding the V H heavy chains and V/e light 
chains of HMFG1 and Hu HMFG1 are shown in Figure 15 and arc given 
in Verhoeyen a at (1993) Immunology 78, 364-370, incorporated herein 
by reference. 



25 Legend to Figure 15 



Nucleotide and amino acid sequences of mouse and reshaped HMFG1 
variable regions, (a) Heavy chain variable region sequences for mouse 
and reshaped HMFG1 (Mo V„-HMFG1 and Hu V„-HMFG1); (b) mouse 
and reshaped light chain variable regions respectively (Mo V.-HMFG1 and 
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Hu V.-HMFG1). Amino acids numbering and definition of the CDR and 
framework regions are from Kabat et al (1987) Sequences of Proteins of 
Immunological Interest, Edn 4, US Dept of Health and Human Services 
Public Health Service, NIH, Bethesda, MD 20892, USA. 

5 

The methods described by Bird et al (1988) Science 242, 423 or Huston 
et al (1988) Proc. Natl Acad. ScL USA 85, 5879 are applied to the 
nucleotide sequences described in Figure 15 to generate genes encoding 
ScFv for HMFG1 and ScFv for Hu HMFG1. These genes are fused 
10 individually into the adenovirus penton fibre gene as described in 
Examples 1 and 2. 

The amino acid sequences of the V H and V L chains of H17E2 are disclosed 
in "Monoclonal antibodies - applications in clinical oncology", pages 37- 
15 43, 1991, A. A. Epenetos, ed.. Chapman & Hall, UK. 

Nucleotide sequences encoding the V H and V L chains are readily derived 
from the amino acid sequence using the genetic code and an ScFv can be 
made from the sequences using the methods of Bird et al or Huston et al 
20 as described above. 



Kev to Sequence Listing 



Name 


SEQ ID No. j 




Nucleotide 


Polypeptide 1 




Sequence 


Sequence \ 
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1 Fusion A 


i 


2 


1 Fusion B 


5 


A 

4 


1 Fusion C 


e 
J 


A* 

0 


I Fusion D 


1 

1 


o 

8 


5 


B Fusion £ 


o 
y 


10 | 


I Fusion F 


xi 


to 1 

12 




Fusion G 


10 


14 




Fusion H 


13 


16 




Fusion I 


1*7 
1 / 


18 


10 


Fusion J 


io 


20 


1 Fusion K 


zi 


ZZ 




Fusion L 




24 




Fusion M 


ZD 


26 




Fusion N 


Z/ 


28 


15 


Fusion O 


zy 


30 




Fusion P 


31 


32 




Fusion Q 


33 


34 




Fusion R 


35 


36 


I Fusion S 


in 

37 


38 


20 


Fusion T 


39 


40 




Fusion U 


41 


42 




Fusion V 


A^ 

43 


44 




Xho-Eco 


45 






LEADHBACK 


46 


- 


25 


LEADbFOR 


A^ 

47 


48 




pRAS117 


Af\ 

49 


50 




TAILbBACK 


SI 


52 


1 FIBRE3FOR 


53 


54 


1 FEBRE6FOR 


55 


56 


30 


FD3RE9FOR 


57 


58 




FIBRE 12FOR 


59 


60 




FIBRE 15FOR 


61 


62 




FIBRE 18FOR 


63 


64 




FIBRE21FOR 


65 


66 


35 


FIBRE22FOR 


67 


68 




FIBREPFOR 


69 


70 




pRASlll 


71 


72 




MoV H 


73 


74 




MoV, 


75 


76 


40 1 


HuV H 


77 


78 


I HuV « 


79 


80 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION : 



(i) APPLICANT! 

( inl SS^i. 11 "? 8 ^? 1 CanC8r Ra»«*rch Technology Limited 

(B) STREET: Sardinia House, Sardinia Street 

(C) CITY : London 

(E) COUNTRY: United Kingdom 
(D POSTAL COSE (ZIP)t WC2A 3NL 

(ii) TITLE OF INVENTION: Compounds to target cells 

(lii) NUMBER OP SEQUENCES : 80 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM T YPE; Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (BPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CH ARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 
(ili) HYPOTHETICAL: NO 
(ili) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Adenovirus 

(B) STRAIN: Ad 5 

<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..30 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1: 

CCT CTA GTT ACC TCC AAT CTG CAG CTC CAG 
30 

Pro Leu val Thr Ser Asn Val Cln Leu Gin 

1 5 10 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Pro Leu Val Thr Ser Asn Val Cln Leu Gin 
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1 5 10 

(2) INFORMATION FOR SEQ ID NO: 3: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH i 30 baaa pairs 

(B) TYPE: nuelale acid 

(C) STRANDED NESS : doubla 
(0) TOPOLOGY: linaar 

(11) MOLECULE TYPE i DNA (genomic) 

(HI) H Y POT HE TICAL » NO 

(Hi) ANT I -SENSE i NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Adanovirua 

(B) STRAIN: AdS 

(i-x) FEATURE: 

(A) NAME /KEY : COS 

(B) LOCATION: 1..30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CTC TCT CTG GAC GAG CCC GTC CAG CTC CAG 
30 

Lau Sar Lau Aap Clu Ala Val Gin Leu Gin 

i5 10 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 Amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: protain 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Lam Sar Lau Asp Glu Ala Val Gin Lau Gin 

1 5 10 

(2) INFORMATION FOR SEQ ID NO: 5: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 basa pairs 

(B) TYPE: nuclaic acid 

(C) STRAND ED NESS : doubla 

(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: DNA (genomic) 
(Hi) HYPOTHETICAL: NO 
(ill) ANTI-SENSE: NO 

(vi) ORICINAL SOURCE: 

(A) ORGANISM: Adanovirua 

(B) STRAIN: AdS 

(ix) FEATURE: 
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(A) NAME /KEY I CDS 

(B) LOCATION: I.. 30 



<xi) SEQUENCE DESCRIPTION j SEQ ID NO: 5: 

CCT CTC AAA AAA ACC AAC CTG CAO CTC CAC 
30 

Pro Leu Lys Lys Thr Lys Val Cln Leu Gin 

1 5 10 



(2) INFORMATION FOR SEQ ID NOl 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 10 amino acids 

(B) TYPE: amino acid 
(O) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Pro Leu Lys Lys Thr Lya Val Cln Leu Gin 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 baa* pairs 

(B) TYPE: nuclaic acid 

(C) STRANDED NESS : double 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE t HO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Adenovirus 

(B) STRAIN: Ad 5 

(ix) FEATURE: 

(A) NAME /KEY: COS 

(B) LOCATION: 1..30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CCC CTC ACA GTT ACC TCA CTG CAC CTC CAG 
30 

Pro Leu Thr Val Thr Ser Val Cln Leu Gin 

15 10 



(2) INFORMATION FOR SEQ ID NO: 8: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 aalno acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION J SEQ ID NO: 81 

Pro Lau Thr Val Thr Ser Val Gin Lau Cln 

1 5 io 

(2) INFORMATION FOR SEQ ZD NO» 9> 

(1) SEQUENCE CHARACTERISTICS x 

(A) LENGTH t 30 baa* paira 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESSi double 
(O) TOPOLOGY i linear 

(11) MOLECULE TYPE i DMA (genomic) 

(111) HYPOTHETICAL: NO 

(111) ANTI-SENSE: NO 

(Vl) ORIGINAL SOURCE: 

(A) ORGANISM: Adenovirus 

(B) STRAIN: AdS 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION : 1..30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NOs 9: 

CCT CTA ATG GTC COS GGC CTC CAO CTG CAC 
30 

Pro Leu Met Val Ala Gly Val Gin Leu Cln 

15 io 



(2) INFORMATION FOR SEQ ID NO: 10: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: llnaar 

(11) MOLECULE TYPE: p retain 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 x 

Pro Lau Hat Val Ala Cly Val Cln Lau Cln 

15 io 

(2) INFORMATION FOR SEQ ID NO: 11: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 basa pairs 

(B) TYPE: nuclaic acid 

(C) STRANDED NESS : double 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL i NO 

(ill) ANTI -SENSE: NO 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: Adenovirus 
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(B) STRAIN: AdS 

(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 1..30 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: II: 

COG CTA ACC CTG CAC GAC CTC CAC CTG CAS 
30 

Pro Leu Thr Val Bis Asp Val Gin Leu Gin 

1 5 io 



(2) INFORMATION FOR SEQ ID NO: 12: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 12: 

Pro Leu Thr Val Bis Asp Val Gin Leu Gin 
1 5 io 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 basa pairs 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) H Y POT H ETICAL: NO 
(ili) ANTI -SENSE: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Adenovirus 

(B) STRAIN: AdS 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION : 1..30 



(xi) SEQUENCE DESCRIPTION i SEQ ID NO: 13: 

CCC CTC ACA CTG TCA CAA CTC CAC CTG CAC 
30 

Pro Leu Thr Val Ser Clu Val Gin Leu Gin 

15 io 



(2) INFORMATION FOR SEQ ID NO: 14: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(11) MOLECULE TYPE i protain 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 14 » 

Pro Lau Thr Val Sar clu Val Gin Lau Gin 

IS 10 

(2) INFORMATION TOR SEQ ID HO I 15: 

(1) SEQUENCE CHARACTERISTICS I 

(A) LENGTH i 30 baaa pairs 

(B) TYPE i nuclalc acid 

(C) STRAND ED NESS i doabla 

(D) TOPOLOGY » llnaar 

(11) MOLECULE TTPZi DNA (ganomlc) 
(111) HYPOTHETICAL t NO 
(111) ANTI-SENSE t NO 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: Adenovirus 

(B) STRAIN: AdS 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..30 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 15: 

CTC ACC ACC ACC CAT ACC CTC CAC CTC CAC 
30 

Lau Thr Thr Thr Asp Sar Val Gin Lau Gin 

15 10 



(2) INFORMATION FOR SEQ ID NO: 16: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino aeida 

(B) TYPE: amino acid 
(D) TOPOLOGY: llnaar 

(11) MOLECULE TYPE: protaln 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Lau Thr Thr Thr Aap Sar Val Gin Lau Gin 

1 5 10 

(2) INFORMATION FOR SEQ ID NO: 17 t 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 baaa pairs 

(B) TYPE: nuclalc acid 

(C) STRAND ED NESS i double 

(D) TOPOLOGY: llnaar 

(11) MOLECULE TYPE: DNA (genomic) 
(111) HY POTHET ICAL: NO 



(111) ANTI -SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Adenovirus 

(B) STRAIN: Ad5 

(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATIONS 1..30 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NOt 17 I 

OCT CTA ACT ACT CCC ACT CTC CA6 CTG CAC 
30 

Pro Lau Thr Thr Ala Thr Val Gin Lau Oln 

IS 10 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino aeida 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION t SEQ ID NO: 18: 

Pro Leu Thr Thr Ala Thr Val Gin Lau Gin 

15 io 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 baaa pairs 

(B) TYPE: nuclaic acid 

(C) STRAND ED NESS : double 

(D) TOPOLOCY: linear 

(11) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
(ill) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Adenovirus 

(B) STRAIN: AdS 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..30 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CCC ATT TAT ACA CAA AAT CTC CAC CTG CAG 
30 

Pro Xla Tyr Thr Gin Aen Val Gin Lau Gin 
15 10 



(2) INFORMATION FOR SEQ ID NO: 20: 

(1) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 10 amino acida 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE » protain 

(Xl) SEQUENCE DESCRIPTION 1 SEQ ZD NO: 20l 

Pro lis Tyr Thr Gin Xan Val Oln Leu Gin 

15 10 

(2) INFORMATION FOR SEQ ZD HO» 21a 

(1) SEQUENCE CHARACTERISTICS i 

(A) LENGTH i 30 base pairs 

(B) TYPE i nucleic acid 

(C) STRANDEDNBSS: double 

(D) TOPOLOGY i linear 

(11) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
(ill) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Adenovirus 

(B) STRAIN: AdS 

(Ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CAT GTA ACA GAC CAC CTA GTC CAC CTC CAG 
30 

His Val Thr Asp Asp Leu Val Gin Uu Gin 

15 10 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

His Val Thr Asp Asp Leu Val Gin Leu Gin 

IS 10 

(2) INFORMATION FOR SEQ ID NO: 23: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOCY: linear 

(11) MOLECULE TYPE: DNA (genomic) 



(ill) HYPOTHETICAL: NO 
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(ill) AKTI-SEHSEi NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Adenovirus 

(B) STRAIN: Ad5 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GGT CTG ACT ATT AAT AAT CTO CAG CTG CAG 
30 

Gly Val Thr He Aan Aan Val Gin Lau Gin 

15 10 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acida 

(B) TYPE: amino acid 
(0) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Gly Val Thr Ila Aan Aan Val Gin Lau Gin 

15 10 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 baaa pairs 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESS : doubla 

(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Adenovirus 

(B) STRAIN: AdS 

(lx) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 1 

GGT TTT GAT TCA CAA GGC CTG CAG CTG CAG 
30 

Gly Pha Aap Sar Gin Gly Val Gin Leu Gin 

15 10 



(2) INFORMATION FOR SEQ ID NO: 26: 
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(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 

(B) TYPE* amino acid 
(O) TOPOLOGY: linear 

( 11) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ IS NO: 26: 

Gly Pha Asp Ser Gin Cly Val Gin Leu Gin 

15 10 

(2) INFORMATION FOR SEQ TO HO l 27 t 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 baaa palra 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESS: doable 

(D) TOPOLOGY: llnaar 

(11) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
(ill) ANT I -SENSE: NO 

(▼1) ORIGINAL SOURCE: 

(A) ORGANISM: Adenovirus 

(B) STRAIN: Ad 5 

(Ijc) FEATURE i 

(A) NAME /KEY t CDS 

(B) LOCATION: 1..30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NOi 27: 

AGG ATT GAT TCT CAA AAC GTG GAG CTG CAC 
30 

Arg lie Asp Ser Gin Aan Val Gin Leu Gin 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 28: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH i 10 amino acida 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Arc He Aap Ser Gin Aan Val Gin Leu Gin 

15 10 

(2) INFORMATION FOR SEQ ID NO: 29: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 
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(ill) HYPOTHETICAL: NO 
(ill) ANTI-SENSE: HO 

(Vi) ORICINAL SOURCE: 

(A) ORGANISM: Adenovirua 

(B) STRAIN j Ad5 

(Ix) FEATURE i 

(A) NAME /KEY: CDS 

(B) LOCATION* 1..30 



(xi) SEQUENCE DESCRIPTION: SEQ ID BO J 29: 

TTT GAT OCT CAA AAC CAA CTG CAG CTC CAC 
30 

Phe Asp Ala Oln Aan Gin Val Gin Leu Gin 

IS 10 



(2) INFORMATION FOR SEQ ID NO: 30: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 30: 

Pha Asp Ala Gin Aan Gin Val Gin Lau Gin 

15 10 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUEN CE CH ARACTERISTICS : 

(A) LENGTH: 30 baaa pair a 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (Genomic) 
(iii) H Y PO T HE TICAL: NO 
(Hi) ANTI-SENSE i NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Adenovirus 

(B) STRAIN: AdS 

(ix) FEATURE: 

(A) RAMS/KEY l CDS 

(B) LOCATION: 1..30 



(xi) SEQUENCE DESCRIPTION ; SEQ ID NO: 31: 

CTT TTT ATA AAC TCA GCC CTG CAG CTG CAG 
30 

Leu Pbe lie Aan Ser Ala Val Gin Leu Gin 

15 10 
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(2) INFORMATION FOR SEQ ID NOt 32: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION l SEQ ID NOi 32 1 

Leu Phe He Asn Ser Ala Val Gin La>u Gin 
15 10 

(2) INFORMATION FOR SEQ ID NO I 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 baa* pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Adenovirus 

(B) STRAIN: Ad5 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..30 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 33: 

TCA AAC AAT TCC AAA AAC CTC CAG CTC CAG 
30 

Ser Asn Asn Ser Lys Asn Val Gin Leu Gin 

15 10 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS I 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 34: 

Ser Asn Asn Ser Lys Asn Val Gin Leu Gin 

15 10 

(2) INFORMATION FOR SEQ ID NO: 35 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EONESS : double 

(D) TOPOLOGY: linear 
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(11) MOLECULE TYPE: DMA (genomic) 
(111) HYPOTHETICAL: NO 
(111) ANTI- SENSE: NO 

(▼1) ORIGINAL SOURCE: 

(A) ORGANISM: Adenovirus 

(B) STRAIN i AdS 

(lac) FEATURE t 

(A) HAKE /KEY: CDS 

(B) LOCATION: X..30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GGG TTC ATC TTT GAC GCT CTG CAS CTC CAS 
30 

Gly Leu Met Phe Asp Ala Val Gin Leu Gin 

IS 10 



(2) INFORMATION FOR SEQ ID NO: 36: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 1 

Gly Leu Met Phe Asp Ala Val Gin Lea Gin 

15 10 

(2) INFORMATION FOR SEQ ID NO: 37: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 
(111) H Y P OT HE TICAL: NO 
(111) ANTI -SENSE: NO 

(▼1) ORIGINAL SOURCE: 

(A) ORGANISM: Adenovirus 

(B) STRAIN: Ad 5 

(ix) FEATURE I 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

CCT AAT CCA CCA AAC ACA CTC CAG CTG CAG 
30 

Pro Aan Ala Pro Asn Thr Val Gin Leu Gin 

IS 10 
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(2) INFORMATION FOR SEQ ID NOt 38: 

(i) SEQUENCE CHARACTERISTICS i 

(A) LENGTHS 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY t linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION I SEQ 10 NOi 38: 

Pro Aan Ala Pro Asn Thr Val Gin Leu Oln 

15 10 

(2) INFORMATION FOR SSQ 10 NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairi 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Adenovirus 

(B) STRAIN: Ad5 

(ix) FEATURE: 

(A) NAME /KEY i COS 

(B) LOCATION: 1..30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

CTA GAA TTT GAT TCA AAC GTG CAG CTG CAG 
30 

Leu Clu Phe Aap Ser Aan Val Gin Leu Gin 

15 10 



(2) INFORMATION FOR SEQ ID NO: 40: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 ami no acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Leu Glu Phe Asp Ser Aan Val Gin Leu Gin 

IS 10 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH i 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
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(D) topology i linear 
(11) MOLECULE TYPE: DNA (genomic) 
(HI) HYPOTHETICAL: NO 
(111) ANT2- SENSE: NO 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM j Adenovirus 

(B) STRAIN: Ad5 

(ix) FEATURE i 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..30 



(xl) SEQUENCE DESCRIPTION: SEQ 10 NO: 41: 

CTT AGT TTT CAC AGC ACA GTG CAC CTG CAC 
30 

Leu Ser Phe Asp Ser Thr Val Gin Leu Gin 
15 io 



(2) INFORMATION FOR SEQ ID NO: 42: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Leu Ser Phe Asp Ser Thr Val Gin Leu Gin 

15 io 

(2) INFORMATION FOR SEQ ID NO: 43: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DMA (genomic) 
(111) HYPOTHETICAL: NO 
(ill) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Adenovirus 

(B) STRAIN: Ad 5 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

ATT GAT AAG CTA ACT TTC CTO CAC CTG CAC 
30 
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II* Asp Lys Leu Thr Leu Val Gin Leu Oln 

15 10 



(2) INFORMATION FOR SEQ ID HO( 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENG TH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION* SEQ ZD NO» 44 1 

lis Asp Lys Leu Thr Leu Val Gin Leu Gin 

15 10 

(2) INFORMATION FOR SEQ ID NO J 45 i 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : double 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: DNA (genomic) 
<iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Adenovirus 

(B) STRAIN: Ad5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

CTCGAGTAAT AAGAATTC 
18 

(2) INFORMATION FOR SEQ ID NO: 46: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NOt 46: 

ACCTAACCTT CCATCCAAAT TC 
22 

(2) INFORMATION FOR SEQ ID NO: 47: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 31 base pairs 

(B) TYPE: nuelaic acid 

(C) STRAND ED NESS : single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL! NO 

(iii) ANTI-SENSE: NO 



(ix) FEATURE I 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..27 



(xi) SEQUENCE DESCRIPTION: SEQ ID NOt 47: 

CCA CCC ATG CCC ACA TCT CAC CTC CAC ACCT 
31 

Pro Ala Mat Ala Arg Ser Gin Lau Gin 
1 5 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acida 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Pro Ala Mat Ala Arg Sar Gin Lau Gin 

1 5 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 ba«a pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 40.. 132 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

AAGCTTGCAT GCAAATTCTA TTTCAAGGAG ACAGTCATA ATG AAA TAC CTA TTG 
54 

Mat Lya Tyr Lau Lau 

1 S 
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CCT ACS CCA GCC GCT CGA TTC TTA TTA" CTC CCT GCC CAA CCA GCC ATG 
102 

Pro Thr Ala Ala Ala Gly Lau Leu Lau Lau Ala Ala Gin Pro Ala Hat 

10 IS 20 

GCC AGA TCT CAG CTC CAC CTC CAC GGA TCC 
132 

Ala Arg Sar Gin Lau Gin Val Asp Cly Sar 

25 30 

(2) INFORMATION FOR SEQ ID NOs 50 t 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 31 amino acids 

(B) TYPE: amino acid 
(O) TOPOLOGY i linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Mat Lys Tyr Lau Leu Pro Thr Ala Ala Ala Gly Lau Lau Lau Lau Ala 

15 10 15 

Ala Gin Pro Ala Met Ala Arg Ser Gin Leu Gin Val Aap Gly Ser 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: Si: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 bee* pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

( ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: S..28 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

AGCT AGA TCT ATG AAG CCC CCA AGA COS 
28 

Arg Ser Met Lys Arg Ala Arg Pro 

1 5 

(2) INFORMATION FOR SEQ ID NO: S2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
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Arg Ser Met Lys Arg Ala Arg Pro 

1 5 

(2) INFORMATION FOR SEQ ID NO: S3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : ■ ingle 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPES ONA (genomic) 
(ill) HY POTHETI CAL t NO 
(ill) ANTI -SENSE: NO 



(lx) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..33 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

CCT CTC AAA AAA ACC AAG CAC GTG CAG CTC CAG CAGCCTGG 
41 

Pro Leu Lya Lya Thr Lya Gin Val Gin Leu Gin 

1 5 10 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acida 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Pro Leu Lya Lya Thr Lya Gin Val Gin Leu Gin 

15 10 

(2) INFORMATION FOR SEQ ID NO: 55: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY > linear 

(11) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
(ill) ANTI-SENSE: NO 



(lx) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 34 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
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C CCC CTA ACC CTC CAC CAC CAG CTC CAG CTC CAG CACCCTCC 
42 

Pro Leu Thr Val Hia Asp Cln Val Gin Leu Gin 

IS 10 



(2) INFORMATION FOR SEQ 10 NOt 56 1' 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH t 11 amino acida 

(B) TYPE i amino acid 
(0) TOPOLOGY: linaar 

(li) MOLECULE TYPE i protein 

(xi) SEQUENCE DESCRIPTION i SEQ 10 NOt 56: 

Pro Leu Thr Val Hia Aap Gin Val Gin Leu Gin 

1 5 10 

(2) INFORMATION FOR SEQ ID NOt 57 t 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 41 base pairs 

(B) TYPE i nucleic acid 

(C) STRANDEDNESS i aingle 

(D) TOPOLOGY t linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL t NO 
(ill) ANTI-SENSE i NO 



(ix) FEATURE t 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..33 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

CCT CTA ACT ACT CCC ACT CAC CTC CAC CTC CAO CACCCTCG 
41 

Pro Leu Thr Thr Ala Thr Gin Val Cln Leu Cln 
IS 10 



(2) INFORMATION FOR SEQ ID NO: 58: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acida 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE t protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NOt 58: 

Pro Leu Thr Thr Ala Thr Cln Val Gin Leu Gin 

IS 10 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH t 41 base pairs 
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(B) TYPE! nuclaic add 

(C) STRANDEDNESS: aingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
(ill) ANTI-SENSE: NO 



(lx) FEATURE t 

(A) HAKE /KEY: CDS 
(8) LOCATION i 1..33 



<xt) SEQUENCE DESCRIPTION: SEQ ZD NO: 59: 

OCT CTG ACT ATT AAT AAT CAC CTG CAG CTC CAC GACCCTCG 
41 

Gly Val Thr He Aan Aan Gin Val Gin Leu Gin 
15 10 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acida 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 60: 

Gly Val Thr He Aan Asn Gin Val Gin Leu Gin 

15 10 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 baae paira 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : aingla 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOT HE TICAL: NO 
(iii) ANTI -SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..36 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

CCC TTT CAT CCT CAA AAC CAA CAC CTC CAC CTG CAC CAGCC 
41 

Pro Pha Asp Ala Gin Aan Gin Gin Val Gin Leu Gin 
15 10 
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(2) INFORMATION FOR SEQ ID NOt 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NOt 62: 

Pro Pha Asp Ala Oln Asn Gin Gin Val Gin Lau Gin 
15 10 

(2) INFORMATION FOR SEQ ID NOt 63 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 basa paira 

(B) TYPE: nuclaic acid 

(C) STRAND EDNESS : a ingle 

(D) TOPOLOCY: linear 

(ii) MOLECULE TYPE: DNA (ganomic) 
(iii) HYPOTHETICAL: NO 
(ill) ANTI-SENSE : NO 



(i_X) FEATURE! 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..33 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 1 

GGG TTC ATG TTT GAC CCT CAG CTG CAC CTC CAG CAGCC 
38 

Gly Leu Met Pha Aap Ala Gin Val Cln Leu Gin 

15 10 



(2) INFORMATION FOR SEQ ID NO: 64: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NOt 64: 

Gly Leu Met Phe Aap Ala Cln Val Cln Leu Gin 

IS 10 

(2) INFORMATION FOR SEQ ID HO: 65: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH i 40 baae pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(Hi) HYPOTHETICAL: NO 
(ill) ANTI -SENSE I NO 



(lx) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 3.. 35 



{xi) SEQUENCE DESCRIPTION: SEQ IS NOj 65 1 

CC CTT ACT TTT CAC AGC ACA CAS CTC CAS CTC CAS CAGCC 
40 

Leu Ser Phe Asp Ser Thr Cln V*l Cln Leu Gin 

15 10 



(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Leu Ser Phe Asp Ser Thr Cln Val Cln Leu Gin 

IS 10 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: SO base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii). MOLECULE TYPE: DNA (genomic) 
(ili) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



( ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..4S 



(xi) SEQUENCE DESCRIPTION : SEQ ID NOi 67 : 

GGA AAC AAA AAT AAT CAT AAC CTA ACT TTC CAC GTC CAC CTC CAC 
45 

Sly Asn Lys Asn Asn Asp Lys Leu Thr Leu Gin Val Gin Leu Gin 

IS 10 15 

CAGCC 

SO 



(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: is amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLSCOLE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Gly Asn Lya Aan Aan Aap Lya Leu Thr Leu Gin Val Gin Leu Gin 

15 10 is 

(2) INFORMATION TOR SEQ ID NO: 69 t 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 43 baae pairs 

(B) TYPE: nucleic acid 

(C) STRANSEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iil) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 17 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

CA TAC ATT GCC CAA CAA TAACACCTCC AGCTGCAGCA GCCTGG 
43 

Tyr He Ala Gin Glu 

1 5 



(2) INFORMATION FOR SEQ ID NO: 70: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Tyr He Ala Gin Glu 

1 S 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 658 baae pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iii) ANTI —SENSE : NO 
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(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 40.. 846 

(xi) SEQUENCE DESCRIPTION: SEQ ZD NO: 71: 

AACCTTGCAT CCAAATTCTA TTTCAACGAS ACACTCATA ATS AAA TAC CTA TTC 
54 

Nat Lye Tyr Leu Lara 

1 S 

CCT ACC CCA GCC OCT GGA TTC TTA TTA CTC OCT CCC CAA CCA CCC ATO 
102 

Pro Thr Ala Ala Ala Gly Lau Leu Leu Leu Ala Ala Gin Pro Ala Mat 
10 15 20 

GCC CAG CTG CAG CTC CAG CAG CCT GGG GCT GAG CTT GTG AAG CCT GGG 
150 

Ala Gin Val Gin Lau Gin Gin Pro Gly Ala Glu Leu Val Lye Pro Gly 

25 30 35 

GCT TCA GTG AAG CTG TCC TGC AAG GCT TCT GGC TAC ACC TTC ACC AGC 
198 

Ala Ser Val Lya Lau Sar Cys Lya Ala Ser Gly Tyr Thr Pbe Thr Sar 
40 45 50 

TAC TGG ATG CAC TGG CTG AAG CAG ACC CCT GGA CCA GCC CTT GAG TCC 
246 

Tyr Trp Met His Trp Val Lya Gin Arg Pro Gly Arg Gly Leu Glu Trp 
55 60 65 

ATT CCA AGG ATT CAT CCT AAT ACT CCT CGT ACT AAG TAC AAT GAG AAG 
294 

lie Gly Arg lie Aap Pro Aan Ser Gly Gly Thr Lya Tyr Aan Glu Lye 

70 75 80 85 

TTC AAG AGC AAC CCC ACA CTG ACT GTA GAC AAA CCC TCC ACC AGA GCC 
342 

Pha Lya Ser Lya Ala Thr Leu Thr Val Aap Lya Pro Ser Ser Thr Ala 
90 95 100 

TAC ATG CAG CTC ACC AGC CTC ACA TCT GAG GAC TCT GCC GTC TAT TAT 
390 

Tyr Mat Gin Leu Ser Ser Leu Thr Ser Glu Aap Ser Ala Val Tyr Tyr 

105 110 115 

TCT CCA ACA TAC CAT TAC TAC CCT AGT ACC TAC TTT CAC TAC TGG GCC 
438 

Cya Ala Arg Tyr Aap Tyr Tyr Gly Ser Ser Tyr Phe Aap Tyr Trp Gly 

120 125 130 

CAA GGG ACC ACG GTC ACC CTC TCC TCA GOT GGA GGC CGT TCA GGC GGA 
486 

Gin Cly Thr Thr Val Thr Val Ser Ser Gly Gly Cly Gly Ser Gly Gly 
135 140 145 

CGT GGC TCT GCC GCT CCC CCA TCC CAC CCT CTT CTG ACT CAG GAA TCT 
534 

Gly Gly Ser Gly Gly Cly Cly Ser Cln Ala Val Val Thr Gin Glu Ser 
150 155 160 165 

CCA CTC ACC ACA TCA CCT GCT GAA ACA CTC ACA CTC ACT TGT CCC TCA 
582 

Ala Lau Thr Thr Sar Pro Gly Glu Thr Val Thr Leu Thr Cya Arg Ser 
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170 175 180 

ACT ACT GGG GCT GTT ACA ACT ACT AAC TAT CCC AAC TGG GTC CAA CAA 
630 

Sax Thr Gly Ala Val Thr Thr Ser Asn Tyr Ala Aan Trp Val Gin Glu 

185 190 195 

AAA CCA GAT CAT TTA TTC ACT GGT CTA ATA GCT GGT ACC AAC AAC CCA 
676 

Lys Pro Aap Hla Lau Pha Thr Gly Lau Zla Gly Gly Thr Aan Asn Arc 

200 205 210 

GCT CCA GGT GTT CCT CCC ACA TTC TCA CCC TCC CTC ATT CCA GAC AAC 
726 

Ala Pro Cly Val Pro Ala Arg Phe Sar Gly Ser Lau Zla Gly Asp Lya 
215 220 225 

GCT CCC CTC ACC ATC ACA CCC CCA CAG ACT GAG GAT GAG CCA ATA TAT 
774 

Ala Ala Lau Thr lis Thr Cly Ala Gin Thr Glu Asp Glu Ala Zla Tyr 
230 235 240 245 

TTC TGT CCT CTA TCC TAC ACC AAC CAC TCC CTC TTC CCT CCA CCA ACC 
822 

Pha Cya Ala Leu Trp Tyr Ser Asn His Trp Val Phe Gly Gly Gly Thr 

250 255 260 

AAA CTC ACT GTC CTA CCT CTC CAC TAATAACAAT TC 
858 

Lys Leu Thr Val Leu Cly Leu Glu 

265 

(2) INFORMATION FOR SEQ ZD NOt 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Cly Leu Leu Leu Leu Ala 

15 10 15 

Ala Gin Pro Ala Mat Ala Gin Val Gin Leu Cln Gin Pro Cly Ala Glu 

20 25 30 

Leu Val Lya Pro Gly Ala Ser Val Lys Leu Ser Cys Lya Ala Ser Gly 
35 40 45 

Tyr Thr Phe Thr Sar Tyr Trp Met His Trp Val Lys Cln Arg Pro Gly 

50 55 60 

Arg Gly Leu Glu Trp lie Cly Arg lie Asp Pro Asn Ser Cly Gly Thr 
65 70 75 80 

Lys Tyr Asn Clu Lys Phe Lys Ser Lys Ala Thr Leu Thr Val Asp Lys 

85 90 95 

Pro Ser Ser Thr Ala Tyr Met Gin Leu Ser Ser Leu Thr Ser Glu Aap 

100 105 110 

Ser Ala Val Tyr Tyr Cys Ala Arg Tyr Asp Tyr Tyr Cly Ser Ser Tyr 
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IIS 120 125 

Phe Aap Tyr Trp cly Gin Cly Thr Thr Val Thr Val Ser Ser Oly Gly 
130 135 140 

Gly Gly Ser Gly Gly Gly Gly Ser Gly Cly Gly Gly Ser Gin Ala Val 
145 150 155 160 

Val Thr Gin Clu Ser Ala Leu Thr Thr Ser Pre Gly Glu Thr Val Thr 
165 170 175 

Leu Thr Cye Arg Ser Ser Thr Cly Ala Val Thr Thr Ser Aen Tyr Ala 

180 185 190 

Aen Trp Val Gin Glu Lye Pro Asp His Leu Phe Thr Gly Leu Zle Gly 
195 200 205 

Gly Thr Asn Asn Arg Ala Pro Gly Val Pro Ala Arg Phe Ser Gly Ser 
210 215 220 

Leu lie Gly Aap Lys Ala Ala Leu Thr Zle Thr Gly Ala Gin Thr Glu 
225 230 235 240 

Asp Glu Ala He Tyr Phe Cys Ala Leu Trp Tyr Ser Aen His Trp Val 
245 2S0 255 

Phe Cly Gly Cly Thr Lys Leu Thr Val Leu Gly Leu Glu 
260 265 

(2) INFORMATION FOR SEQ ID NO: 73: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EONESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(ill) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouse 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..354 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

CAG CTT GAG CTG CAG CAG TCT CCA CCT GAG CTG ATG AAG CCT GGG CCC 
48 

Gin Val Gin Leu Gin Gin Ser Gly Ala Clu Leu Met Lye Pro Cly Ala 

IS 10 IS 

TCA GTG AAG ATA TCC TGC AAG CCT ACT GGC TAC ACA TTC ACT CCC TAC 
96 

Ser Val Lys He Ser Cys Lys Ala Thr Cly Tyr Thr Phe Ser Ala Tyr 

20 25 30 

TGG ATA GAG TGG GTA AAG CAG AGG CCT GGA CAT GGC CTT GAG TGC ATT 
144 

Trp He Clu Trp Val Lys Gin Arg Pro Cly Hie Gly Leu Glu Trp He 
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35 40 4S 

CCA GAG ATT TTA CCT GGA ACT AAT AAT TCT AGA TAC AAT GAG AAC TTC 
192 

Gly Glu Zle Leu Pro Gly Ser Asn Asn Ser Arg Tyr Asn Olu Lys Phe 

50 55 60 

AAC CCC AAG GCC ACA TTC ACT CCT GAT ACA TCC TCC AAC ACA CCC TAC 
240 

Lys Gly Lys Ala Thr Phe Thr Ala Asp Thr S«r Ser Asn Thr Ala Tyr 

65 70 75 80 

ATG CAA CTC ACC ACC CTO ACA TCT GAG CAC TCT GCC GTC TAT TAC TOT 
288 

Mat Gin Leu Ser Ser Leu Thr Ser Glu Aap Ser Ala Val Tyr Tyr Cya 

85 90 95 

TCA AGG TCC TAC GAC TTT GCC TCC TTT GCT TAC TGG GCC CAA GGG ACT 
336 

Ser Arg Ser Tyr Aap Phe Ala Trp Phe Ala Tyr Trp Gly Gin Gly Thr 

100 105 110 

COG GTC ACT GTC TCT CCA 
354 

Pro Val Thr Val Ser Ala 

115 

(2) INFORMATION -FOR SEQ ID NO: 74: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(zl) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Met Lye Pro Gly Ala 

15 10 15 

Ser Val Lye Zle Ser Cya Ly» Ala Thr Gly Tyr Thr Phe Ser Ala Tyr 

20 25 30 

Trp lie Glu Trp Val Lys Gin Arg Pro Gly His Gly Leu Glu Trp Zle 
35 40 45 

Gly Glu Zle Leu Pro Gly Ser Aan Aan Ser Arg Tyr Aan Glu Lya Phe 

SO 55 60 

Lys Gly Lys Ala Thr Phe Thr Ala Aep Thr Ser Ser Asn Thr Ala Tyr 

65 70 75 80 

Met Gin Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr Cys 

65 90 95 

Ser Arg Ser Tyr Aap Phe Ala Trp Phe Ala Tyr Trp Gly Gin Gly Thr 

100 105 110 

Pro Val Thr Val Ser Ala 
115 

(2) INFORMATION FOR SEQ ID NO: 75: 
(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 342 baa* pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

(ill) ANTI-SENSE i NO 

(vi) ORIGINAL SOURCE I 

(A) ORGANISM: Mouse 

(i_x) FEATURE: 

(A) NAME /KEY: COS 

(B) LOCATION: 1..342 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 75: 

GAC ATT CTG ATG TCA CAG TCT CCA TCC TCC CTA CCT GTG TCA GTT CCA 
48 

Aap Ila Val Met Ser Gin Ser Pro Ser Ser Leu Ala Val Sar V&l Gly 
15 10 IS 

GAG AAG GTT ACT ATG ACC TCC AAG TCC ACT CAG AGC CTT TTA TAT ACT 
96 

Glu Lye V&l Thr Met Ser Cya Lys Ser Ser Gin Ser Leu Leu Tyr Ser 
20 25 30 

AGC AAT CAA AAG ATC TAC TTC GCC TGG TAC CAG CAG AAA CCA GGG CAG 
144 

Sar Aan Gin Ly» lie Tyr Leu Ala Trp Tyr Gin Gin Lya Pro Gly Gin 
35 40 4.5 

TCT CCT AAA CTG CTG ATT TAC TCC CCA TCC ACT AGC CAA TCT GGG GTC 
192 

Ser Pro Lys Leu Leu lie Tyr Trp Ala Ser Thr Arg Glu Ser Gly Val 

50 55 60 

CCT GAT CCC TTC ACA GCC GCT CGA TCT GGG ACA GAT TTC ACT CTC ACC 
240 

Pro Asp Arg Phe Thr Gly Gly Uly Ser Cly Thr Aap Phe Thr Leu Thr 
65 70 75 80 

ATC AGC ACT CTG AAG GCT CAA GAC CTG CCA CTT TAT TAC TCT CAG CAA 
288 

lie Sar Sar Val Lys Ala Glu Aap Leu Ala Val Tyr Tyr Cya Gin Gin 

85 90 9S 

TAT TAT AGA TAT CCT CCG ACC TTC GCT GGA GGC ACC AAG CTG GAA ATC 
336 

Tyr Tyr Arg Tyr Pro Arg Thr Pha Gly Gly Gly Thr Lya Leu Glu Ila 
100 105 110 

AAA CCG 
342 
Lya Arg 



(2) INFORMATION FOR SEQ ID NO: 76: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 114 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76t 

Asp He Val Met Ser Gin Ser Pro Ser Ser Leu Ala Val Ser Val Gly 
1 5 10 15 

Glu Lye Val Thr Met Ser Cys Lye Ser Ser Gin Ser Leu Leu Tyr Ser 

20 25 30 

Ser Aan Gin Lys He Tyr Leu Ala Trp Tyr Gin Gin Lye Pro Gly Gin 
35 40 45 

Ser Pro Lye Leu Leu He Tyr Trp Ala Ser Thr Arc Glu Ser Gly Val 
50 55 60 

Pro Asp Arg Phe Thr Gly Cly Gly Ser Gly Thr Asp Phe Thr Leu Thr 
65 70 75 80 

He Ser Ser Val Lye Ala Glu Asp Leu Ala Val Tyr Tyr Cya Gin Gin 
85 90 95 

Tyr Tyr Arg Tyr Pro Arg Thr Phe Gly Gly Gly Thr Lye Leu Glu He 

100 105 no 

Lys Arg 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 baa* pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
(ill) ANTI -SENSE: NO 

(ix) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..354 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 1 

CAG GTG CAG CTG CTC CAG TCT GGG GCA GAG GTG AAA AAC CCT GGG GCC 
48 

Gin Val Gin Leu Val Cln Ser Cly Ala Glu Val Lys Lys Pro Cly Ala 

15 10 15 

TCA GTG AAG GTG TCC TGC AAG OCT TCT GGC TAC ACC TTC ACT GCC TAC 
96 

Ser Val Lys Val Ser Cys Lys Ala Ser Cly Tyr Thr Phe Ser Ala Tyr 

20 25 30 

TGC ATA GAG TCG GTG CCC CAC CCT CCA CGA AAG GGC CTC GAG TGC CTC 
144 

Trp He Glu Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 
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35 40 45 

GOA CAS ATT TTA CCT CCA ACT AAT AAT TCT ASA TAC AAT GAG AAC TTC 
192 

Gly Clu lie Leu Pro Gly Ser Aan Asn Sar Arg Tyr Aen Glu Lya Phe 
50 55 60 

AAC GGC CCA CTC ACA CTC ACT ASA GAC ACA TCC ACA AAC ACA CCC TAC 
240 

Lys Gly Arg Val Thr Val Thr Arg Asp Thr Sar Thr Asn Tbr Ala Tyr 
65 70 75 60 

ATG GAG CTC AGC ACC CTC AGG TCT GAG GAC ACA CCC CTC TAT TAC TCT 
288 

Met Clu Lau Sar Sar Lau Arg Sar Glu Aap Thr Ala Val Tyr Tyr Cya 

85 90 95 

CCA AGA TCC TAC GAC TTT CCC TCC TTT CCT TAC TCC GGC CAA GGG ACT 
336 

Ala Arg Ser Tyr Asp Phe Ala Trp Phe Ala Tyr Trp Gly Gin Gly Thr 
100 105 110 

CTG GTC ACA GTC TCC TCA 
354 

Lau Val Thr Val Sar Ser 

115 

(2) INFORMATION FOR SEQ ID NO: 78t 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 118 amino aclda 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(li) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Gin Val Gin Lau Val Cln Ser Cly Ala Clu Val Lya Lya Pro Gly Ala 

1 S 10 15 

Sar Val Lya Val Ser Cya Lya Ala Ser Gly Tyr Thr Phe Sar Ala Tyr 

20 25 30 

Trp lie Glu Trp Val Arg Cln Ala Pro Gly Lya Gly Lau Glu Trp Val 
35 40 45 

Gly Glu lie Leu Pro Cly Ser Asn Asn Ser Arg Tyr Aan Glu Lya Phe 
SO 55 60 

Lya Gly Arg Val Thr Val Thr Arg Asp Thr Ser Thr Aan Thr Ala Tyr 
65 70 75 80 

Mat Clu Lau Sar Sar Leu Arg Ser Clu Aap Thr Ala Val Tyr Tyr Cya 

8S 90 95 

Ala Arg Ser Tyr Aap Phe Ala Trp Phe Ala Tyr Trp Gly Gin Gly Thr 

100 105 110 

Lau Val Thr Val Ser Ser 

115 

(2) INFORMATION FOR SEQ ID NO: 79: 
(1) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 342 base paira 

(B) TYFE> nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ONA (genomic) 
(ill) HYPOTHETICAL: NO 
(ill) ANTI-SENSE: NO 

(lx) FEATURE: 

(A) NAME /KEY: COS 

(B) LOCATION: 1.-342 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

GAC ATC CAG ATG ACC CAG AGC CCA AGC AGC CTG AGC CCC AGC CTG GGT 
48 

Asp II* Gin Mat Thr Gin Ser Pro Ser Ser Leu Ser Ala Sar Val Gly 

15 10 15 

GAC AGA GTG ACC ATC ACC TGT AAG TCC ACT CAG AGC CTT TTA TAT ACT 
96 

Aap Arg Val Thr lie Thr Cya Lya Ser Ser Gin Ser Leu Leu Tyr Ser 

20 25 30 

ACC AAT CAA AAG ATC TAC TTG GCC TGG TAC CAG CAG AAG CCA GGT AAG 
144 

Ser Asn Gin Lya lie Tyr Leu Ala Trp Tyr Gin Gin Lya Pro Gly Lye 
35 40 45 

OCT CCA AAG CTG CTG ATC TAC TGG CCA TCC ACT AGG CAA TCT GGT GTG 
192 

Ala Pro Lya Leu Leu lie Tyr Trp Ala Ser Thr Arg Glu Ser Gly Val 

SO 55 60 

CCA AGC AGA TTC AGC GGT ACC GGT AGC GGT ACC GAC TTC ACC TTC ACC 
240 

Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Phe Thr 

65 70 75 80 

ATC AGC AGC CTC CAG CCA CAG GAC ATC CCC ACC TAC TAC TGC CAG CAA 
288 

lie Ser Ser Leu Gin Pro Glu Aap lie Ala Thr Tyr Tyr Cya Gin Gin 

85 90 95 

TAT TAT AGA TAT CCT CCC ACC TTC CCC CAA CGG ACC AAG CTG CAA ATC 
336 

Tyr Tyr Arg Tyr Pro Arg Thr Phe Gly Gin Gly Thr Lya Val Glu lie 

100 105 110 

AAA CCT 
342 
Lya Arg 



(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 114 amino acids 
(8) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(11) MOLECULE TYPE: protein 

(XL) SEQUENCE DESCRIPTION: SEQ 10 NO: 80 t 

Aap Zle Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Sar Val Cly 

15 10 15 

Asp Arg Val Thr Zle Thr Cya Lya Sar Sar Gin Sar Lau Leu Tyr Sar 

20 25 30 

Sar Aan Gin Lys Zla Tyr Lau Ala Trp Tyr Gin Gin Lya Pro Gly Lya 
35 40 45 

Ala Pro Lya Lau Lau Zla Tyr Trp Ala Sar Thr Are Glu Sar Oly Val 

50 55 60 

Pro Sar Arg Pha Sar Gly Ser Gly Ser Gly Thr Aap Pha Thr Phe Thr 

65 70 75 80 

Zla Sar Ser Lau Gin Pro Glu Aap Zla Ala Thr Tyr Tyr Cya Gin Gin 
85 90 95 

Tyr Tyr Arg Tyr Pro Arg Thr Pha Cly Gin Gly Thr Lya Val Glu Zle 

100 105 110 



Lya Arg 
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77 

CLAIMS 

1. A virus, or virus-like particle, derived from a virus or virus-like 
particle having a receptor for a host cell comprising a modified 
binding specificity conferred by a binding moiety allowing the 
virus or virus-like particle to bind to a target cell rhnrnf*»ri yf d in 
that the said host cell receptor is modified or absent so that the 
virus or virus-like particle is substantially incapable of binding 
the said host cell. 

2. A virus or virus-like particle according to Claim 1 wherein the 
target cell is eukaryotic. 



3. A virus or virus-like particle according to Claim 2 that is an 
15 adenovirus, influenza virus, vaccinia virus, any other animal 

virus or replication-defective derivative of any of these. 

4. An adenovirus or influenza virus or vaccinia virus, or a 
replication defective derivative of any of these, characterized in 

20 that the virus has a modified binding specificity conferred by a 

binding moiety allowing the virus to bind to a target cell. 

5. A virus or virus-like particle according to any of Claims 1 to 4 
wherein the binding moiety is a monoclonal antibody, an ScFv, 

25 a dAb, or a minimal recognition unit of an antibody. 



6. 



A virus or virus-like particle according to any of Claims 1 to 4 
wherein the binding moiety is at least part of a ligand of a target 
cell-specific cell-surface receptor. 
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7. A virus or virus-like particle according to Claim 5 or 6 wherein 
the binding moiety recognises a target cell-specific surface 
antigen. 

5 8. A virus according to any one of Claims 1 to 7 wherein the 
binding moiety is joined to a molecule on the virus or virus-like 
particle other than the receptor for its host cell. 

9. A virus or virus-like particle according to any one of Claims 1 to 
10 7 wherein the binding moiety is joined to or forms part of the 

receptor on the said virus or virus-like particle for its host. 

10. A virus or virus-like particle according to Claim 8 wherein the 
said molecule on the surface of the virus or virus-like particle is 

15 a protein. 

11. A virus or virus-like particle according to Claim 6 wherein the 
target cell-specific cell-surface receptor is any one of GnRH 
receptor, MSH receptor and somatostatin receptor. 

20 

12. A virus or virus-like particle according to any one of Claims 1 to 
1 1 modified further to contain a gene suitable for gene therapy. 

13. A virus or virus-like particle according to Claim 12 wherein the 
25 gene encodes a molecule having a directly or indirectly cytotoxic 

function. 
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15. A virus or virus-like particle according to Claim 13 wherein the 
gene encodes an enzyme capable of converting a relatively non- 
toxic pro-drug into a cytotoxic drug. 

16. A virus or virus-like particle according to Claim 15 wherein the 
gene is either cytosine deaminase or thymidine kinase. 

17. A virus or virus-like particle according to Claim 12 wherein the 
gene overcomes a defect in a gene in the target cell. 

18. A virus or virus-like particle according to Claim 17 wherein the 
gene is any one of CFTR, dystrophin and haemoglobin A. 



19. A virus, or virus-like particle, containing nucleic acid, according 
15 to any one of Claims 1 to 15 wherein the said virus or virus-like 

particle is adapted to deliver the said nucleic acid to the target 
cell. 

20. A virus or virus-like particle according to Claim 1 wherein the 
20 said receptor comprises protein. 

21. A virus according to Claim 20 wherein the virus is influenza 
virus and the said receptor is the haemagglutinin receptor protein. 

25 22. A virus according to Claim 20 wherein the virus is adenovirus 
and the said receptor is the penton fibre protein. 



23. 

30 



A virus according to Claim 22 wherein the binding moiety is 
fused to the penton fibre protein at any one or more of the 
junctions of the repetitive units of the shaft. 
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24. A virus according to Claim 23 wherein the binding moiety is a 
ScFv. 

25. A virus according to Claim 24 wherein the ScFv binds to a 
5 tumour cell antigen. 

26. A virus or virus-like particle according to any one of Claims 1 to 
25 wherein the binding moiety is a polypeptide. 

10 27. A virus or virus-like particle according to Claim 26 when 
dependent on either of Claims 10 or 20 wherein the binding 
moiety is fused to the protein on the surface of the said virus or 
virus-like particle. 

15 28. A virus or virus-like particle according to any one of Claims 1 to 
27 for use in medicine. 

29. A nucleotide sequence encoding the fusion of the binding moiety 
and the protein on the surface of the virus or virus-like particle 

20 according to any one of Claims 23 to 25 and 27. 

30. A nucleotide sequence encoding the receptor modified as defined 
in Claim 8, wherein the receptor comprises a polypeptide 
backbone. 



25 



31. A nucleotide sequence defined in any of Claims 29 or 30 
additionally comprising the remainder of the genome of the virus 
or virus-like parti cL. 



30 32. 



A nucleotide sequence encoding a virus or virus-like particle 
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according to any one of Claims 1 to 27. 



33. A therapeutic system comprising a virus or virus-like particle 
according to Claim IS or 16 and a pro-drug. 

5 

34. A method for producing a virus or virus-like particle according 
to any of Claims 1 to 27 in cell culture, the method comprising 
(1) infecting the cells with the said virus or virus-like particle, (2) 
culturing the infected cells until the virus or virus-like particle 

10 reaches a sufficiently high titre, (3) harvesting and substantially 

purifying the virus or virus-like particle and (4) joining the 
binding moiety to the substantially purified virus or virus-like 
particle. 

15 35. A method for producing a virus or virus-like particle according 
to any of Claims 1 to 27 in cell culture, the method comprising 
(1) genetically modifying the virus or virus-like particle to 
produce a binding moiety, (2) infecting cells with the genetically 
modified virus or virus-like particle, (3) culturing the cells until 

20 the virus or virus-like particle reaches a sufficiently high titre and 

(4) harvesting and substantially purifying the genetically modified 
virus or virus-like particle. 

36. A pharmaceutical composition comprising a virus or virus-like 
25 particle according to any one of Claims 1 to 27 and a 

pharmaceutical carrier. 



30 



37. 



A method of treating a mammal having target cells to be 
destroyed, the method comprising administering the virus or 
virus-like particle according to Claim 13. 
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38. A method of treating a mammal having target cells to be 
destroyed, the method comprising (1) administering a virus or 
virus-like particle according to Claim 15 or 16, (2) allowing the 
virus or virus-like particle to bind to and deliver its nucleic acid 

5 to the target cell and (3) administering the said pro-drug. 

39. A method of treating a mammal having a defective gene, the 
method comprising administering the virus or virus-like particle 
according to Claim 17 or 18. 

10 
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Fusion A 

12 3 4 108 109 

P L V T S N V Q L Q L E ' * 

CCTCTAGTTACCTCCAATGTffCA£?j^SCaff. . .ScFv. . . CTCCAGTAA TA AGAA TTC 
31210 | PstI .. .Xhol EcoRI 



Fusion B 

12 3 4 108 109 

L S L D E A V Q L Q L E * * 

CTCTCTCTGGACGAGGCCGTGCAG£XGCAG. ..ScFv. . . CTCGAGTAATA AGAA TTC 
31260 I PstI .. .Xhol EcoRI 



Fusion C 

1 2 3 4 108- 109 

P L K K T K V Q L Q L E * * 

CCTCTCAAAAAAACCAAGGTGCAGCinC&C- • .ScFv. . . CTCGAGTAA TA AGAA TTC 
31310 31320 I PstI . . .Xhol EcoRI 



Fusion D 

12 3 4 108 109 

P L T V T S V 0 L Q L E ' * 

CCCCTCACAGTTACCTCAC7rGCAC7£Z2Gaff. - .ScFv. . . CTCGAGTAA TA AGAA TTC 
31360 I PstI .. .Xhol EcoRI 



Fusion E 

12 3 4 108 109 

P L M V A D V Q L Q L E * * 

CC7C7AATGGTCGCGGGCGTGCAGCXGCAG . . .ScFv. . . CICGAGTAATA AGAATTC 
31400 31410 I PstI .. .Xhol EcoRI 

Figure 6 (Page 1 of 5) 
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Fusion F 

12 3 4 108 109 

PLTVHD V Q L Q L E * 

CCGC7MlCCGTGCACGACGTGCAG£IG£&£. ..ScFv. . . CTCGAGTAATA AGAA TTC 
31450 | PstI .. .Xhol EcoRI 



Fusion G 

12 3 4 108 109 

P L T V S E V Q L Q L E * • 

CCCCTCACAGTGTCAGAAGTGCAG£TGGa£7. - .ScFv. . . CTCGA G TAA TA AGAA TTC 
31490 31500 I PstI .. .Xhol EcoRI 



Fusion H 

12 3 4 108 109 

L T T T D S V Q L Q L E ' * 

CTCACCACCACCGATAGCCrCCAGEZCeaff. - -ScFv. . . CTCGAGTAATA AGAA TTC 
31540 31550 I PstI .. .Xhol EcoRI 



Fusion I 

12 3 4 108 109 

P L T T A T V Q L Q L E • » 

CCTCTAACTACTGCCACTffrGCACCZffiCaG. • .ScFv. . . CTCGA G TAA TA AGAA TTC 
31590 I PstI .. .Xhol EcoRI 



Fusion J 

12 3 4 108 109 

P I Y T Q N V Q L Q L E * * 

CCCATTTATACACAAAATGTCCAGEICCaG. . .ScFv. . . CTCGA G TAA TA AGAA TTC 
31630 31640 I PstI .Xhol EcoRI 

Pigure 6 (Page 2 of 5) 
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Fusion K 

12 3 4 108 109 

H V T D D L V Q L Q L E * * 

CATG?AACAGACGACC7AGTGCAG£I££AG . . .ScFv. . .£I£GMTAATAAGAAX2£ 

31730 I Pstl . . .Xhol EcoRI 



Fusion L 

12 3 4 108 109 

G V T I N N V Q L Q L E * * 

GGTGTGACTATTAATAATGTGCAG!CZ£C&£« . .ScFv. . . CTCGA G TAA TA AGAA TTC 
31730 I Pstl .. .Xhol EcoRI 



Fusion M 

12 3 4 108 109 

G F D S Q G V Q L Q L E * * 

GGTTTTGATTCACAAGGCGT<7CA<7CZGGar7. . - ScFv. . . CTCGAG TAA TA AGAA TTC 
31780 I Pstl . . .Xhol EcoRI 



Fusion N 

12 3 4 108 109 

R I D S Q N V 0 L Q L E * * 

AGGATTGATTCTCAAAACCTCCAG.CHZCA£. . .ScFv. . . CTCGAG TAA TAA GAATTC 
31830 I Pstl .. .Xhol EcoRI 



Fusion 0 

12 3 4 108 109 

F 0 A Q N Q V 0 L Q L E * * 

TTTGATGCTCAAAACCAAGrGCAG£ZGC&G. . .ScFv. . . CTCC7AG TAATA AGAATTC 
31880 I Pstl .. .Xhol EcoRI 
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Fusion P 

12 3 4 
P FIHSAVQLQ. 
CTTTTTATAAACTCAGCCGTCCACiZrCCaff • 
31920 I PstI . 



108 109 

L E * • 

ScFv. . .£ZCGAGTAATAA£AAZZ£ 
.Xhol EcoRI 



Fusion Q 

12 3 4 

SNNSKNV0&0. 
TCAAACAATTCCAAAAACCTCCAGEZSaG. 

32030 320401 PstI . 



108 109 
L E * * 

ScFv. . . rrrnA ci taa taa kaa rrr 
. Xhol EcoRI 



Fusion R 

12 3 4 
GLMFDAV0L0. 
GGGTTGATGTTTGACGCTffrffCAG£IGCA£T. 
32030 32040 1 PstI . 



108 109 

L E * * 

SCFV. . . CTCnAn TAATA AGAATTr 

.Xhol EcoRI 



Fusion S 

12 3 4 
PNAPNTVQLC. 
CCTAATGCACCAAACACAff TGCAG£ZG£A£ . 

32100 I PstI . 



108 109 

L E * • 

ScFv. . . £2££&£TAA TAAGAAIZ£ 
. Xhol EcoRI 



Fusion T 

12 3 4 
LEFD SNVQLQ. 
CTAGAATTTGATTCAAACGrffCAGCICGl^. 

32150 I PstI . 



ScFv. 



108 109 
. .L E * * 
. CTCGAC, TAA TAA^AAIIC 
. Xhol EcoRI 



Figure 6 (Page 4 of 5) 
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Fusion U 

12 3 4 108 109 

LSFDSTVOL 0. L E * • 

CTTAGTTTTGACAGCACAffTffCAGCTCCaff. • .ScFv. . . £I££AGTAATAAGMXI£ 
32190 I Pstl .. .Xhol EcoRI 



Fusion V 

12 3 4 108 109 

I D K L T L V Q L Q L E * * 

ATTGATAAGCTAACTTTGffrffCAGCICOa^- • .ScFv. . . CTCGA GTAA TAAGAA TTC 
32240 I Pstl .. .Xhol EcoRI 
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10 20 30 40 50 

M K Y L L P 
AAf;rTTr;rATf;r AAATTgTATTTPAAr;f;Ar;Ar.Ar:TCATAArGAA^ taccta TTGCCT 

HindlllSphI SD I 

60 70 80 90 100 110 

TAAAGLLLLAAQPAMARS Q 
ACGGCAGCCGCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGC CAGATrT n&r. 

pelB leader 1 B?lll\V H 

120 130 
L Q V D G S 

Pstl Sail EcoRl 



Figure 9 
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Figure 10 
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RSMKRARP 
TAILbBACK AGCTAGAICIATCAAGCGCGCAAGACCG 

Bglll 

FIBRE3FOR — fibre / — scFv 

PLNRARQVQLQ 
CCTCTCAAAAAAACCAAGCAGGTGCAGCIGCAGCAGCCTGG 

Pstl 



FIBRE 6 FOR — fibre / — scFv 

PLTVHDQVQLQ 
CCCGCTAACCGTGCACGACCAGGTGCAGCICCACCAGCCTGG 

Pstl 



FIBRE9FOR — fibre / — scFv 

PLTTATQVQLQ 
CCTCTAACTACTGCCACTCAGGTGCAG£ICCACCAGCCTGG 

Pstl 



FIBRE12FOR —fibre / — scFv 

GVTI NNQVQLQ 
GGTGTGACTATTAATAATCAGGTGCAGCXGCAGGACCCTGG 

Pstl 



FIBRE 15 FOR —fibre / — scFv 

PFDAQNQOVQLQ 
CCGTTTGATGCTCAAAACCAACAGGTGCAGCIGCAGCAGCC 

PSCI 



Figure 12 (Page 1 of 2) 
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FIBRE18FOR —fibre / — scFv 

GLMFDGQVQLQ 
GGGTTGATGTTTGACGCTCAGGTGCAGCI££AGCAGCC 

PSCI 



FIBRE21FOR —fibre /— scFv 

LSFDSTQVQLQ 
GCCTTAGTTTTGACAGCACACAGGTGCAGC1GCAGCAGCC 

PsCl 



FIBRE22FOR --fibre / — scFv 

GNKNNDKLTLQVQLQ 
GGAAACAAAAATAATGATAAGCTAACTTTGCAGGTGCAGCIGCAGCAGCC 

Psei 



FIBREPFOR — fibre 

Y I A Q E * 
CATACATTGCCCAAGAATAACAGGTGCAGCIGCAGCAGCCTGG 

PstI 
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10 7.0 30 in 50 

m k r /. /. /' 

AAC,rTTr.r ATCC AAATTrT AT-rrC A*aC.*G *CAGTCATAATGAAATACeTATTr:r/~'r 
Hindi I ISphl SD / 

60 70 00 90 100 1 10 
T A A A C L 1. t. L A A 0 I' A M A Q V Q 
ACCCCACCCGCTGGA TTGTTA TTACTCGCTGCCCAACCAGCGA TGGCCC AGGTGC AG 
pelB leader /VH NP 

120 130 140 ISO 160 170 

LQQPGAELVKPGASVKLSC 
C1GCACCAGCCTGGGGCTGAGCTTGTGAAGCCTGGGGCTTCAGTGAAGCTGTCCTGC 
Pscl 

180 190 200 210 220 2 

KASGYTFTSYWMHWVKQR P 

A&r:f:rTTrTr:f:rTArArrTTrArrAr;rTArTr:r:ATf:rACTc:f;f:Tf:AAr.rAf:Ar.f:rrT 

CDR1 

30 240 250 260 270 280 

GRGLEWIGRID. PNSGGTRY 
r:nArr:Af:r.rrTTf:Af:Tnf:ATTGGAAr:f;ATTnATf:rTAATAGT GGTGGTArTAAfTTAr . 

CDR2 

290 300 310 320 330 340 

NE K FLSKATLTVDK P SSTA 
AATGAGAAGTTCAAGAGC AACGCCACACTGACTGTAGACAAACCCTCCAGCACAGCC 

350 360 370 380 390 40 

YMQLSSLTSEDSAVYYCAR 
TACATGCAGCTCAGCAGCCTGACATCTGAGGACTCTGCGGTCTATTATTGTGCAAGA 

0 410 120 430 440 4 SO 

YD Y YGSSYFDYWGQGTTLT 
TArGATTArTArnnTAGTAnrTArTTTCAfTACTCGGGCCAAGGGAf-rAr GGTrArr 
CDR3 SscET I 

460 470 480 490 S00 510 

VS SGGGGSGGGGSGGGGSQ 
GTCTCCTC&GGTGGAGGCGGTTCAGGCGGAGGTGGCTCTGGCGGTGGCGGAICCCAG 

I (G«S) 3 linker BamH I - / v/A. 

520 530 540 550 560 r ,70 

V V L T Q E S A L T T 5 P G K T V T !. 
GCTGTTCTGACTCAGGAATCTCCACTCACCACATCACCTC;GTGAAACAGTCACACTr: 

S80 590 600 610 620 

TCRSSTGAVTTSNYANWVO 

Ar-rrr:Trf:rTrAAf:TArTnr,r,nrTrTTArAArT^rTAArTATrrrAArTr:r.r.TfrAA 

CDR1 
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G30 640 G50 GGO 670 600 

EKPOULFTCI. IGGTNNKAP 
GAAAAACCAGATCATTTATTCACTGGTCTAATAGGTfifiTfirfTAACAA(XGAfiCTrr/\ 

Kpnl CDR2 Sscl 

690 700 710 720 7 3° 740 

GVP ARFSGS LIGDKAA LT I 
GGTGTTCCTGCCAGATTCTCAGGCTCCCTGATTGGAGACAAGGCTGCCCTCACCATC 

750 760 770 780 790 8 

TGAQTEDEAirFCALWYSN 
ACAGGGGCACAGACTGAGGATGAGGCAATATATTTCTGT ffrTrTATnriTftrftfiCTAACT 

CDR3 

00 810 820 830 840 850 

HWVFGGGTKLTVLGLE • • 
rArTfffiCTn TTCGGTGGAGGAACCAAACTGACTGTCCTAGGTCIgiAGTAATAAGAA 

Xhol Eco 



TTC 
RI 
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